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HUMAN MONOCLONAL ANTIBODY 

Field of the Invention 

5 This invention relates to novel human monoclonal 

antibodies (mAbs) and to the genes encoding same. More 
specifically, this invention relates to human monoclonal 
antibodies specifically reactive with an epitope of the 
fusion (F) protein of Respiratory Syncytial Virus (RSV) . 
10 Such antibodies are useful for the therapeutic and/or 
prophylactic treatment of RSV infection in human 
patients, particularly infants and young children. 

Background of the Invention 

15 Respiratory syncytial virus (RSV) is the major 

cause of lower respiratory disease in children, giving 
rise to predictable annual epidemics of bronchiolitis 
and pneumonia in children worldwide. The virus is 
highly contagious, and infections can occur at any age. 

2 0 Comprehensive details concerning RSV infection and its 

clinical features can be obtained from excellent recent 
reviews by Mcintosh, K. and R. M. Chanock, In: 
"Respiratory Syncytial Virus", Ch. 38, B.N. Fields ed. , 
Raven Press (1990) and Hall, C.B., In: "Textbook of 
25 Pediatric Disease" Feigin and Cherry, eds . , W.B. 
Saunders, pgs 1247-1268 (1987). 

RSV is distributed worldwide. One of the most 
remarkable features of the epidemiology of RSV virus, as 
mentioned above, is the consistent pattern of infection 

3 0 and disease. Other respiratory viruses cause epidemics 

at irregular intervals or exhibit a mixed 
endemic /epidemic pattern, but RSV is the only 
respiratory viral pathogen that produces a sizable 
epidemic every year in large urban centers . In the 
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temperate areas of the world, RSV epidemics have 
occurred primarily in the late fall, winter or spring 
but never during the summer. The occurrence and spread 
of infection within a community is characteristic and 
5 easily diagnosed, leading to sharp rises in cases of 

bronchiolitis and pediatric pneumonia and the number of 
hospital admissions of young children with acute lower 
respiratory tract disease. Other respiratory viral 
agents that occur in outbreaks are rarely present at the 

10 same time as RSV. 

Primary RSV infection occurs in the very young. 
Zero to 2 year old infants are the most susceptible and 
represent the primary affected population. In this 
group, 1 out of 5 will develop lower respiratory (below 

15 larynx) disease upon infection and this ratio stays the 
same upon reinfection. By 1 year of age, 25-50% of 
infants have specific antibodies as a result of natural 
infection and this is close to 100% by age 4-5. Thus, 
virtually all children have been infected before they 

2 0 have entered school. 

Age, sex, socioeconomic and environmental factors 
can all influence the severity of disease. 
Hospitalization is required in 1-3% of cases of RSV 
infection and is usually of long duration (up to 3 

25 weeks) . The high morbidity of RSV infection, especially 
in infancy, has also been implicated in the development 
of respiratory problems later in life. With current 
intensive care in the U.S. and the other developed 
countries, overall mortality for normal subjects is low 

30 (less than 2% of hospitalized subjects) . However, 

mortality is much higher in less developed countries 
and, even in developed countries, mortality is high in 
certain risk groups such as in infants with underlying 
cardiac condition (cyanotic congenital heart disease) or 



9 



wo 00/69462 PCT/USOO/13694 

respiratory disease (bronchopulmonary dysplasia) where 

the progression of symptoms may be rapid. For instance, 
mortality in infants with cyanotic congenital heart 
disease has been reported to be as high as 37%. In 
5 premature infants apneic spells due to RSV infection may 
occur and, in rare cases, cause neurologic or systemic 
damage. Severe lower respiratory tract illness 
(bronchiolitis and pneumonia) is most common in patients 
under six months of age. Infants who have apparently 

10 recovered completely from this illness may display 
symptomatic respiratory abnormalities for years 
(recurrent wheezing, decreased pulmonary function, 
recurrent cough, asthma, and bronchitis). 

Immunity to RSV appears to be short-lived, thus 

15 reinfections are frequent. The mechanisms by which the 
immune system protects against RSV infection and 
reinfection are not well understood. It is clear, 
however, that immunity is only partially protective 
since reinfection is common at all ages, and sometimes 

2 0 occurs in infants only weeks after recovery from a 

primary infection. Both serum and secretory antibodies 
(IgA) have been detected in response to RSV infection in 
adults as well as in very young infants. However, the 
titers of serum antibodies to the viral F or G 
25 glycoprotein, as well as of neutralizing antibodies 

found in infants (1-8 months of age) are 15-25% of those 
found in older subjects. These reduced titers may 
contribute to the increased incidence of serious 
infection in younger children. 

3 0 Evidence for the role of serum antibodies in 

protection against RSV virus has emerged from 
epidemiological as well as animal studies . In adults 
exposed naturally to the virus, susceptibility 
correlated well with low serum antibody level . In 
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infants, titers of maternally transmitted antibodies 

correlate with resistance to serious disease [Glezen, 
W.P. et al., J. Pediatr . 98:708-715 (1981)]. Other 
studies show that the incidence and severity of lower 
5 respiratory tract involvement is diminished in the 

presence of high serum antibody [Mcintosh, K. et a.1 . , J . 
Infect. Pis . 138:24-32 (1978)] and high titers of 
passively administered serum neutralizing antibodies 
have been shown to be protective in a cotton rat model 

10 of RSV infection [Prince, G. A. et al . , Virus Res . 
3 : 193-206 (1985) ] . 

Children laclcing cell-mediated immunity are unable 
to overcome their infection and shed virus for many 
months in contrast to children with normal immune 

15 systems. Similarly, nude mice infected with RSV virus 
persistently shed virus. These mice can be cured by 
adoptive transfer of primed T cells [Cannon, M. J. et 
al., Immunology 52:133-138 (1987)]. 

In summary, it appears that both cellular and 

2 0 humoral immunity are involved in protection against 

infection, reinfection and RSV disease and that although 
antigenic variation is limited, protective immunity is 
not complete even after multiple exposures . 

RSV, belonging to the family paramyoxoviridae , is a 

2 5 negative-strand unsegmented RNA virus with properties 

similar to those of the paramyxoviruses. It has, 
however been placed in a separate genus Pneumovirus, 
based on morphologic differences and laclc of 
hemagglutinin and neuraminidase activities. RSV is 

3 0 pleomorphic and ranges in size from 15 0-300 nm in 

diameter. The virus matures by budding from the outer 
membrane of a cell and virions appear as membrane -bound 
particles with short, closely spaced projections or 
"spilces". The RNA genome encodes 10 unique viral 
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polypeptides ranging in size from 9.5 kDa to 160 kDa 

[Huang, Y. T. and G. W. Wertz, J. Virol . 43:150-157 
(1982)]. Seven proteins (F, N, P, L, M, M2 ) are 

present in RSV virions and at least three proteins (F, 
5 G, and SH) are expressed on the surface of infected 
cells. The F protein [SEQ ID NO : 20] has been 
conclusively identified as the protein responsible for 
cell fusion since specific antibodies to this protein 
inhibit syncytia formation in vitro and cells infected 

10 with vaccinia virus expressing recombinant F protein 
form syncytia in the absence of other RSV virus 
proteins. In contrast, antibodies to the G protein do 
not blocJc syncytia formation but prevent attaciiment of 
the virus to cells, 

15 RSV can be divided into two antigenically distinct 

subgroups, (A & B) [Mufson, M. A. et al . , J , Gen ' 1 . 
Virol ■ 66:2111-2124 (1985)]. This antigenic dimorphism 
is linlced primarily to the surface attaciiment (G) 
glycoprotein [Johnson, R. A. et ai . , Proc . Nat ' 1 . Acad. 

20 Sci. USA 84:5625-5629 (1987)]. Strains of both group A 
and B circulate simultaneously, but the proportion of 
each may vary unpredictably from year to year. An 
effective therapy must therefore target both subgroups 
of the virus and this is the reason for the selection of 

25 the highly conserved surface fusion (F) protein as 
target antigen for mAb therapy as will be discussed 
later . 

The induction of neutralizing antibodies to RSV 
virus appears to be limited to the F and G surface 
3 0 glycoproteins. Of these two proteins, the F protein is 
the major target for cross-reactive neutralizing 
antibodies associated with protection against different 
strains of RSV virus. In addition, experimental 
vaccination of mice or cotton rats with F protein also 
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results in cross protection. The antigenic relatedness 
of the F protein across strains and subgroups of the 
virus is reflected in its high degree of homology at the 
amino acid level. In contrast, in the two subgroups and 
5 various strains of RSV, antigenic dimorphism was linked 
primarily to the G glycoprotein. The F protein has a 
predicted molecular weight of 68-70 kDa; a signal 
peptide at its N-terminus; a membrane anchor domain at 
its C terminus; and is cleaved proteolytically in the 

10 infected cell prior to virion assembly to yield 

disulfide linked F2 and Fi . Five neutralizing epitopes 
have been identified within the F protein sequence [SEQ 
ID NO: 20] and map to residues 205-225; 259-278; 289- 
299; 483-488 and 417-438. Studies to determine the 

15 frequency of sequence diversion in the F protein [SEQ ID 
NO: 20] showed that the majority of the neutralizing 
epitopes were conserved in all of the 2 3 strains of RSV 
virus isolated in Australia, Europe, and regions of the 
U.S. over a period of thirty years. In another study, 

2 0 seroresponses of forty three infants and young children 
to primary infection with subgroup A or a subgroup B 
strain showed that responses to homologous and 
heterologous F antigens were not significantly 
different, while the G proteins of the subgroup A and B 

2 5 strains were quite unrelated. Moreover, antibody 

inhibition of virus -mediated cell fusion in vitro versus 
inhibition of infection correlates best with protection 
in animal models and fusion inhibition is primarily 
restricted to F protein specific antibodies. 

3 0 Prophylactic treatment for RSV infection is thus 

desirable for the high rislc groups of children as well 
as for all children in underdeveloped countries. 
However, a vaccine for RSV infection is not currently 
available. Severe safety issues surrounding an 
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attenuated whole virus vaccine tested in the 1960s, as 
well as the potential of induced iminunopathology 
associated with the newer candidate subunit vaccines 
make the prospects of a vaccine in the near future 
5 appear remote. To date one drug therapy. Ribavirin, a 
broad spectrum antiviral, has been approved. Ribavirin 
has gained only minimal acceptance owing to problems of 
administration, mild toxicity and questionable efficacy. 
In the majority of cases, hospitalized children receive 

10 no drug therapy and receive only intensive supportive 

care which is extremely costly. It is clear that there 
is a need for a safe, effective and easily administered 
drug for the treatment of RSV infection. 

The use of passive antibody therapy in humans is 

15 well documented and is being used to treat other 
infectious diseases such as hepatitis and 
cytomegalovirus. The feasibility of passive antibody 
treatment /protection against RSV has been well 
established using animal models. Most of the earlier 

20 passive transfer studies in animals against infectious 

agents, including RSV, utilized murine mABs . Studies in 
animals have clearly demonstrated that polyclonal and 
monoclonal antibody against both F and G glycoprotein 
can confer passive protection in RSV virus infection 

25 when given prophylactically or therapeutically [Prince, 
et al . , supra ] . In these studies, passive transfer of 
neutralizing F or G mAbs to mice, cotton rats or 
monkeys, significantly reduce or completely prevent 
replication of the RSV virus in the lungs. However, as 

3 0 discussed above, clearly, the F protein is the more 
important target for antibody therapy. 

Recently, the FDA has approved for use intravenous 
gammaglobulins (IVIG) isolated from pooled human sera. 
Initial reports from this study had been encouraging 
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[Groothuis, J. R. et al . , Antimicrob. Agents Chemo . 
35 (7 ): 1469-1473 (1991)]. However, generic shortcomings 
of IVIGs exist and include, without limitation, the fact 
that such products are human blood derived and grams of 
5 antibody often need to be administered to achieve an 
effective dose. 

Alternatively, monoclonal antibodies have been 
employed. The advantages of such an approach include: a 
higher concentration of specific antibody can be 

10 achieved thereby reducing the amount of globulin 

required to be given; the reliance on direct blood 
products can be eliminated; the levels of antibody in 
the preparation can be more uniformly controlled and the 
routes of administration can be extended. While passive 

15 immunotherapy employing monoclonal antibodies from a 

heterologous species (e.g., murine) has been suggested 
(See: PCT Application PCT/US94 / 08699 , Publication No. WO 
95/04081) , one alternative to reduce the risk of an 
undesirable immune response on the part of the patient 

2 0 directed against the foreign antibody is to employ 
"humanized" antibodies. These antibodies are 
substantially of human origin, with only the 
Complementarity Determining Regions (CDRs) being of non- 
human origin. Particularly useful examples of this 

2 5 approach are disclosed in PCT Application 

PCT/GB91/01554, Publication No. WO 92/04381 and PCT 
Application PCT/GB93 /00725 , Publication No, WO93/20210. 
Clinical trials are on-going to evaluate the efficacy of 
humanized antibodies for treatment of RSV infection in 

3 0 young children. 

A second and more preferred approach is to employ 
fully human mAbs . Unfortunately, there have been few 
successes in producing human monoclonal antibodies 
through classic hybridoma technology. Indeed, 
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acceptable human fusion partners have not been 
identified and murine myeloma fusion partners do not 
work well with human cells, yielding unstable and low 
producing hybridoma lines. However, recent advances in 
5 molecular biology and immunology make it now possible to 
isolate human mABs, particularly directed against 
foreign infectious agents. 

Fully human mAbs to RSV F protein [SEQ ID NO: 20] 
remain a desirable option for the treatment of this 

10 disease. Although some success has been reported in 

obtaining fragments of such mAbs [Barbas, C.F. et al . , 
Proc. Nat^l. Acad. Sci. USA 89:10164-10168 (1992); 
Crowe, J. E. et al . , Proc. Nat ^ 1 . Acad. Sci. USA 91: 
1386-1390 (1994) and PCT application number 

15 PCT/US93/08786, published as WO94/06448, March 31, 
1994)], the achievement of such results is not 
straightforward. Novel human mABs, when and however 
obtained, are particularly useful alone or in 
combination with existing molecules to form 

20 immunotherapeutic compositions. 

There exists a need in the art for useful 
prophylactic compositions for the prevention or passive 
treatment of RSV. 

2 5 Brief Description of the Invention 

In one aspect, this invention provides fully human 
monoclonal antibodies and functional fragments thereof 
specifically reactive with an F protein epitope of RSV 
and capable of neutralizing RSV infection. These human 

3 0 mABs specific for the F protein of RSV virus may be 

useful to passively treat or prevent infection. 

In another aspect, the present invention provides 
modifications to neutralizing single chain Fv fragments 
(scFV) specific for the F protein of RSV produced by 
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random combinatorial cloning of human antibody sequences 
and isolated from a filamentous phage Fab display 
library . 

In still another aspect, there is provided a 
5 reshaped or altered human antibody containing human 
heavy and light chain constant regions from a first 
human donor and heavy and light chain variable regions 
or the CDRs thereof derived from human neutralizing 
monoclonal antibodies for the F protein of RSV derived 

10 from a second human donor. 

In yet another aspect, the present invention 
provides a pharmaceutical composition which contains one 
(or more) altered or reshaped antibodies and a 
pharmaceutically acceptable carrier . 

15 In yet another aspect, the invention provides a 

pharmaceutical composition comprising at least one dose 
of an immunotherapeutically effective amount of the 
reshaped, altered or monoclonal antibody of this 
invention in combination with at least one additional 

20 monoclonal, altered or reshaped antibody. A particular 
embodiment is provided in which the additional antibody 
is an anti-RSV antibody distinguished from the subject 
antibody of the invention by virtue of being reactive 
with a different epitope of the RSV F protein antigen 

2 5 than the subject antibody of the invention. 

In a further aspect, the present invention provides 
a method for passive immunotherapy of RSV disease in a 
human by administering to said human an effective amount 
of the phanuaceutical composition of the invention for 

3 0 the prophylactic or therapeutic treatment of RSV 

infection . 

In yet another aspect, the present invention 
provides methods for, and components useful in, the 
recombinant production of human and altered antibodies 
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(e.g., engineered antibodies, CDRs , Fab or F(ab)2 
fragments, or analogs thereof) which are derived from 
human neutralizing monoclonal antibodies (mAbs) for the 
F protein of RSV. These components include isolated 
5 nucleic acid sequences encoding same, recombinant 

plasmids containing the nucleic acid sequences under the 
control of selected regulatory sequences which are 
capable of directing the expression thereof in host 
cells (preferably mammalian) transfected with the 

10 recombinant plasmids. The production method involves 
culturing a transfected host cell line of the present 
invention under conditions such that the human or 
altered antibody is expressed in said cells and 
isolating the expressed product therefrom. 

15 In still another aspect of the invention is a 

method to diagnose the presence of RSV in a human which 
comprises contacting a sample of biological fluid with 
the human antibodies and altered antibodies and 
fragments thereof of the instant invention and assaying 

20 for the occurrence of binding between said human 

antibody (or altered antibody, or fragment) and RSV. 

Other aspects and advantages of the present 
invention are described further in the detailed 
description and the preferred embodiments thereof . 

25 

Brief Description of the Drawings 

Fig. lA is a graph illustrating the competition of 

GX-1 scFV phage binding with RSV19 mAb [International 
patent publication No. WO92/043 81, published March 19, 
30 1992] . 

Fig. IB is a graph illustrating the competition of 

GX-1 scFV phage binding with RSV B4 mAb [International 
patent publication No. WO93/20210, published October 14, 
1993] . 

1 i 
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Fig. 2 is a graph illustrating virus neutralization 

by scFV phages, GX-1 , GX.-3 , and Gr-I with RSV strain 273. 

Fig. 3 illustrates the DNA sequence [SEQ ID NO : 1] 
and protein sequence (amino acids reported in single 

5 letter code) [SEQ ID NO: 2] for the GA,-1 light chain 

variable region, processed N-terminus through framework 
IV. 

Fig. 4 illustrates the DNA sequence [SEQ ID NO: 3] 
and protein sequence (amino acids reported in single 

10 letter code) [SEQ ID NO : 4] for the gA,-1 heavy chain 

variable region, processed N-terminus through framework 
IV. 

Fig. 5 illustrates the cloning strategy used for 

the construction of the gX,-1 monoclonal antibody. The 
15 heavy chain V region was cloned into the pCD derivative 
vector as a Xhol - Apa.X fragment. The entire light 
chain V region was cloned into the pCN derivative 
vector, 43-lpcn, as a SacX - Avrll fragment. Details 
are described below. 
2 0 Fig. 6 provides a comparison of the heavy chain 

amino acid sequences of the G^-1 single chain Fv [SEQ ID 
NO: 5] and various monoclonal antibodies of this 
invention. The amino acid sequences of the heavy chains 
for the A [SEQ ID NO : 7] and B [SEQ ID NO : 8] constructs 

25 are shown. Numbering of the residues is based on the 

germline (GL) gene Dp58 [SEQ ID NO: 6] , beginning at the 
mature processed amino terminus and ending at CDR3 . The 
"-" indicates identity to the preceding sequence (eg., A 
compared to B) . Bold residues correspond to the leader 

30 region, and to CDRs 1-3. 

Fig. 7 provides a comparison of the light chain 

amino acid sequences of the GA.-1A single chain Fv [SEQ ID 
NO: 9] and various monoclonal antibodies of this 
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invention. The amino acid sequences of the light chains 

for the A [SEQ ID NO: 11] and B [SEQ ID NO: 12] 
constructs are shown. Numbering of the residues in the 

Vk region is based on the germline (GL) gene DpL8 [SEQ 
5 ID NO: 10], beginning at the mature processed amino 

terminus and ending at CDR3 . For reference to f rameworJc 

4, the actual numbering is also shown for GA,-1A. As in 
Fig. 6, the indicates identity to the preceding 

sequence . 

10 Figs. 8A to 8F illustrate the continuous DNA 

sequence [SEQ ID NO: 13] of the expression plasmid gA,- 

lApcd containing the RSV neutralizing human G?l-1 mAb for 
the heavy chain. The start of translation, leader 
peptide, amino- terminal processing site, carboxy 

15 terminus of the GX-1 heavy chain, and Eco RI restriction 
endonuclease cleavage site are shown. 

Figs. 9A to 9E illustrate the continuous DNA 

sequence [SEQ ID NO: 14] of the expression plasmid gX- 

lApcn containing the RSV neutralizing human mAb for 

2 0 the light chain. The corresponding features for the 

light chain as for Figs. 8A~8F are shown. 

Figs. lOA and lOB illustrate the continuous DNA 
sequence [SEQ ID NO: 15] of the coding region of the 

heavy chain of plasmid G?l-lBpcd. Bolded residues 
25 indicate differences from the full vector sequence for 

GA,-lApcd in Figs. 8A-8F [SEQ ID NO: 13]. 

Fig. 11 is the DNA sequence [SEQ ID NO: 16] of the 

coding region for the light chain of plasmid GA--lBpcn. 
Bolded residues indicate differences from the full 

3 0 vector sequence for G?i-lApcn in Figs. 9A-9E [SEQ ID NO: 

14] . 
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Detailed Description of the Invention 

This invention provides useful human monoclonal 
antibodies (and fragments thereof) reactive with the F 
protein of RSV, isolated nucleic acids encoding same and 
5 various means for their recombinant production as well 

as therapeutic, prophylactic and diagnostic uses of such 
antibodies and fragments thereof. 

I. Definitions . 

As used in this specification and the claims, the 

10 following terms are defined as follows : 

"Altered antibody" refers to a protein encoded by 
an altered immunoglobulin coding region, which may be 
obtained by expression in a selected host cell. Such 
altered antibodies are engineered antibodies (e.g., 

15 chimeric, humanized, or reshaped or immunologically 

edited human antibodies) or fragments thereof lacking 
all or part of an immunoglobulin constant region, e.g., 
Fv, Fab, or F{ab')2 and the like. 

"Altered immunoglobulin coding region" refers to a 

20 nucleic acid sequence encoding an altered antibody of 
the invention or a fragment thereof. 

"Reshaped human antibody" refers to an altered 
antibody in which minimally at least one CDR from a 
first human monoclonal donor antibody is substituted for 

25 a CDR in a second human acceptor antibody. Preferrably 
all six CDRs are replaced. More preferrably an entire 
antigen combining region (e.g., Fv, Fab or F(ab')2 ) from 
a first human donor monoclonal antibody is substituted 
for the corresponding region in a second human acceptor 

3 0 monoclonal antibody. Most preferrably the Fab region 
from a first human donor is operatively linked to the 
appropriate constant regions of a second human acceptor 
antibody to form a full length monoclonal antibody. 
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"First immunoglobulin partner" refers to a nucleic 

acid sequence encoding a human framework or human 
immunoglobulin variable region in which the native (or 
naturally-occurring) CDR-encoding regions are replaced 
5 by the CDR-encoding regions of a donor human antibody. 

The human variable region can be an immunoglobulin heavy 
chain, a light chain (or both chains) , an analog or 
functional fragments thereof. Such CDR regions, located 
within the variable region of antibodies 

10 (immunoglobulins) can be determined by known methods in 
the art. For example, Kabat et a.1 . ( Sequences of 
Proteins of Immunological Interest , 4th Ed., U.S. 
Department of Health and Human Services, National 
Institutes of Health (1987)) disclose rules for locating 

15 CDRs . In addition, computer programs are known which 
are useful for identifying CDR regions/structures. 

"Second fusion partner" refers to another 
nucleotide sequence encoding a protein or peptide to 
which the first immunoglobulin partner is fused in frame 

2 0 or by means of an optional conventional linker sequence 
(i.e., operatively linked). Preferably the fusion 
partner is an immunoglobulin gene and when so, it is 
referred to as a "second immunoglobulin partner" . The 
second immunoglobulin partner may include a nucleic acid 

2 5 sequence encoding the entire constant region for the 

same (i.e., homologous - the first and second altered 
antibodies are derived from the same source) or an 
additional (i.e., heterologous) antibody of interest. 
It may be an immunoglobulin heavy chain or light chain 

3 0 (or both chains as part of a single polypeptide) . The 

second immunoglobulin partner is not limited to a 
particular immunoglobulin class or isotype. In 
addition, the second immunoglobulin partner may comprise 
part of an immunoglobulin constant region, such as found 
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in a Fab, or F(ab)2 {i.e., a discrete part of an 
appropriate human constant region or framework region) . 

A second fusion partner may also comprise a sequence 
encoding an integral membrane protein exposed on the 
5 outer surface of a host cell, e.g., as part of a phage 
display library, or a sequence encoding a protein for 
analytical or diagnostic detection, e.g., horseradish 

peroxidase (HRP) , jj-galactosidase , etc. 

The terms Fv, Fc, Fd, Fab, or F(ab')2 are used with 

10 their standard meanings [see, e.g., Harlow et al . , 
Antibodies A Laboratory Manual , Cold Spring Harbor 
Laboratory, (1988) ] . 

As used herein, an "engineered antibody" describes 
a type of altered antibody, i.e., a full-length 

15 synthetic antibody (e.g., a chimeric, humanized, 

reshaped or immunologically edited human antibody as 
opposed to an antibody fragment) in which a portion of 
the light and/ or heavy chain variable domains of a 
selected acceptor antibody are replaced by analogous 

2 0 parts from one or more donor antibodies which have 

specificity for the selected epitope. For example, such 
molecules may include antibodies characterized by a 
humanized heavy chain associated with an unmodified 
light chain (or chimeric light chain), or vice versa. 
25 Engineered antibodies may also be characterized by 

alteration of the nucleic acid sequences encoding the 
acceptor antibody light and/or heavy variable domain 
framework regions in order to retain donor antibody 
binding specificity. These antibodies can comprise 

3 0 replacement of one or more CDRs (preferably all) from 

the acceptor antibody with CDRs from a donor antibody 
described herein. 

A "chimeric antibody" refers to a type of 
engineered antibody which contains naturally-occurring 
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variable region (light chain and heavy chains) derived 
from a donor antibody in association with light and 
heavy chain constant regions derived from an acceptor 
antibody from a heterologous species . 
5 A "humanized antibody" refers to a type of 

engineered antibody having its CDRs derived from a non- 
human donor immunoglobulin, the remaining 
immunoglobulin-derived parts of the molecule being 
derived from one (or more) human immunoglobulin ( s ) . In 

10 addition, framework support residues may be altered to 
preserve binding affinity [see, e.g.. Queen et ai . , 
Proc. Nat^l. Acad. Sci. USA , _86: 10029-10032 (1989), 
Hodgson et al . , Bio/Technology, 9 : 421 (1991)]. 

An "immunologically edited antibody" refers to a 

15 type of engineered antibody in which changes are made in 
donor and/or acceptor sequences to edit regions in 
respect of cloning artifacts, germ line enhancements, 
etc. aimed at reducing the likelihood of an 
immunological response to the antibody on the part of a 

2 0 patient being treated with the edited antibody. 

The term "donor antibody" refers to an antibody 
(monoclonal, or recombinant) which contributes the 
nucleic acid sequences of its variable regions, CDRs, or 
other functional fragments or analogs thereof to a first 
25 immunoglobulin partner, so as to provide the altered 
immunoglobulin coding region and resulting expressed 
altered antibody with the antigenic specificity and 
neutralizing activity characteristic of the donor 
antibody. One donor antibody suitable for use in this 

3 0 invention is a Fab fragment of a human neutralizing 

monoclonal antibody designated as Fab gX-1 . Fab GA.-1 is 
defined as a having the variable light and heavy chain 

DNA and amino acid sequences GX-1 as shown in Figs. 3, 
4, 8A-8F and 9A-9E [SEQ ID NOS : 1-4, 13 and 14]. 
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The term "acceptor antibody" refers to an antibody 
(monoclonal or recombinant) from a source genetically 
unrelated to the donor antibody, which contributes all 
(or any portion, but preferably all) of the nucleic acid 
5 sequences encoding its heavy and/or light chain 

framework regions and/or its heavy and/or light chain 
constant regions to the first immunoglobulin partner. 
Preferably a human antibody is the acceptor antibody. 
"CDRs" are defined as the complementarity 
10 determining region amino acid sequences of an antibody 
which are the hypervariable regions of immunoglobulin 
heavy and light chains [see, e.g., Kabat et al . , 
Sequences of Proteins of Immunological Interest , 4th 
Ed., U.S. Department of Health and Human Services, 
15 National Institutes of Health (1987)]. There are three 
heavy chain and three light chain CDRs (or CDR regions) 
in the variable portion of an immunoglobulin. Thus, 
"CDRs" as used herein refers to all three heavy chain 
CDRs, or all three light chain CDRs (or both all heavy 

2 0 and all light chain CDRs, if appropriate) . CDRs provide 

the majority of contact residues for the binding of the 
antibody to the antigen or epitope. CDRs of interest in 
this invention are derived from donor antibody variable 
heavy and light chain sequences, and include analogs of 
25 the naturally occurring CDRs, which analogs also share 
or retain the same antigen binding specificity and/or 
neutralizing ability as the donor antibody from which 
they were derived. 

By "sharing the antigen binding specificity or 

3 0 neutralizing ability" is meant, for example, that 

although Fab GA--1 may be characterized by a certain 
level of antigen affinity, a CDR encoded by a nucleic 
acid sequence of Fab gX,-1 in an appropriate structural 
environment may have a lower, or higher affinity. It is 
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expected that CDRs of Fab GX~1 in such environments will 
nevertheless recognize the same epitope (s) as does the 

intact Fab gA,-1. A "functional fragment" is a partial 
heavy or light chain variable sequence (e.g., minor 
5 deletions at the amino or carboxy terminus of the 

immunoglobulin variable region) which retains the same 
antigen binding specificity and/or neutralizing ability 
as the antibody from which the fragment was derived. 

An "analog" is an amino acid sequence modified by 

10 at least one amino acid, wherein said modification can 
be a chemical modification, or a substitution or a 
rearrangement of a few amino acids (i.e., no more than 
10), which modification permits the amino acid sequence 
to retain the biological characteristics, e.g., antigen 

15 specificity and high affinity, of the unmodified 
sequence. For example, (silent) mutations can be 
constructed, via substitutions, when certain 
endonuclease restriction sites are created within or 
surrounding CDR- encoding regions . 

20 Analogs may also arise as allelic variations. An 

"allelic variation or modification" is an alteration in 
the nucleic acid sequence encoding the amino acid or 
peptide sequences of the invention. Such variations or 
modifications may be due to degeneracy in the genetic 

25 code or may be deliberately engineered to provide 
desired characteristics. These variations or 
modifications may or may not result in alterations in 
any encoded amino acid sequence. 

The term "effector agents" refers to non-protein 

3 0 carrier molecules to which the altered antibodies, 

and/or natural or synthetic light or heavy chains of the 
donor antibody or other fragments of the donor antibody 
may be associated by conventional means. Such non- 
protein carriers can include conventional carriers used 
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in the diagnostic field, e.g., polystyrene or other 
plastic beads, polysaccharides, e.g., as used in the 
BIAcore (Pharmacia) system, or other non-protein 
substances useful in the medical field and safe for 
5 administration to humans and animals. Other effector 
agents may include a macrocycle, for chelating a heavy 
metal atom, or radioisotopes. Such effector agents may 
also be useful to increase the half-life of the altered 
antibodies, e.g., polyethylene glycol. 

10 II. Combinatorial Cloning. 

As mentioned above, a number of problems have 
hampered the direct application of the hybridoma 
technology [G. Kohler and C. Milstein, Nature , 256: 495- 
497 (1975)] to the generation and isolation of human 

15 monoclonal antibodies . Among these are a lack of 

suitable fusion partner myeloma cell lines used to form 
hybridoma cell lines as well as the poor stability of 
such hybridomas even when formed. These shortcomings 
are further exacerbated in the case of RSV because of 

2 0 the paucity of viral specific B cells in the peripheral 

circulation. Therefore, the molecular biological 
approach of combinatorial cloning is preferred. 

Combinatorial cloning is disclosed generally in PCT 
Publication No. WO90/14430. Simply stated, the goal of 
25 combinatorial cloning is to transfer to a population of 
bacterial cells the immunological genetic capacity of a 
human cell, tissue or organ. It is preferred to employ 
cells, tissues or organs which are immunocompetent. 
Particularly useful sources include, without limitation, 

3 0 spleen, thymus, lymph nodes, bone marrow, tonsil and 

peripheral blood lymphocytes. The cells may be 
optionally RSV stimulated in vitro, or selected from 
donors which are known to have produced an immune 
response or donors who are HIV^ but asymptomatic . 



20 



wo 00/69462 PCT/USOO/ 13694 

The genetic information isolated from the donor 
cells can be in the form of DNA or RNA and is 
conveniently amplified by Polymerase Chain Reaction 
(PGR) or similar techniques. When isolated as RNA the 
5 genetic information is preferably converted into cDNA by 
reverse transcription prior to amplification. The 
amplification can be generalized or more specifically 
tailored. For example, by a careful selection of PGR 
primer sequences, selective amplification of 

10 immunoglobulin genes or subsets within that class of 
genes can be achieved. 

Once the component gene sequences are obtained, in 
this case the genes encoding the variable regions of the 
various heavy and light antibody chains, the light and 

15 heavy chain genes are associated in random combinations 
to form a random combinatorial library. Various 
recombinant DNA vector systems have been described to 
facilitate combinatorial cloning [see: PGT Publication 
No. WO90/14430 supra; Scott and Smith, Science 249:386- 

20 406 (1990); or U. S. Patent 5,223,409]. Having 

generated the combinatorial library, the products can, 
after expression, be conveniently screened by biopanning 
with RSV F protein or, if necessary, by epitope blocked 
biopanning as described in more detail below. 

25 As described herein, it is preferred to use single 

chain antibodies for combinatorial cloning and screening 
and then to convert them to full length mAbs after 
selection of the desired candidate molecules. However, 
Fab fragments of mAbs can also be used for cloning and 

3 0 screening. 

III. Antibody Fragments. 

The present invention contemplates the use of scFv, 
Fab, or F(ab')2 fragments to derived full-length mAbs 
directed against the F protein of RSV. Although these 
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fragments may be independently useful as protective and 

therapeutic agents in vivo against RSV-mediated 
conditions or in vitro as part of an RSV diagnostic, 
they are employed herein as a component of a reshaped 
5 human antibody. A scFv fragment contains the light and 
heavy chain variable regions joined by a linker of about 
12 amino acids in either a light-linker-heavy or a 
heavy-linker-light orientation. A Fab fragment contains 
the entire light chain and amino terminal portion of the 

10 heavy chain; and a F{ab')2 fragment is the fragment 
formed by two Fab fragments bound by additional 
disulfide bonds. RSV binding monoclonal antibodies 
provide sources of scFv or Fab fragments which can be 
obtained from a combinatorial phage library [see, e.g., 

15 Winter et al . , Ann . Rev . Immunol . , 12:433-455 (1994) or 
Barbas et al . , Proc . Nat ^ 1 . Acad. Sci . (USA) 89, 10164- 
10168 (1992), which are both hereby incorporated by 
reference in their entireties] . 

IV. Anti-RSV Antihody Amino Acid and Nucleotide 

2 0 Seqziences of Interest . 

The Fab GX-1 or other antibodies described herein 
may contribute sequences, such as variable heavy and/or 
light chain peptide sequences, framework sequences, CDR 
sequences, functional fragments, and analogs thereof, 
25 and the nucleic acid sequences encoding them, useful in 
designing and obtaining various altered antibodies which 
are characterized by the antigen binding specificity of 
the donor antibody. 

As one example, the present invention thus provides 

3 0 variable light chain and variable heavy chain sequences 

from the RSV human Fab G^-IA and sequences derived 

therefrom. The heavy chain variable region of Fab gA,~1A 
is illustrated by Figs. 4, 8A-8F and lOA-lOB [SEQ ID 
NOS: 3-4, 13 and 15] . 
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The nucleic acid sequences of this invention, or 
fragments thereof, encoding the variable light chain and 
heavy chain peptide sequences are also useful for 
mutagenic introduction of specific changes within the 
5 nucleic acid sequences encoding the CDRs or framework 

regions, and for incorporation of the resulting modified 
or fusion nucleic acid sequence into a plasmid for 
expression. For example, silent substitutions in the 
nucleotide sequence of the framework and CDR-encoding 

10 regions can be used to create restriction enzyme sites 
which would facilitate insertion of mutagenized CDR 
(and/or framework) regions. These CDR-encoding regions 
may be used in the construction of reshaped human 
antibodies of this invention. 

15 Taking into account the degeneracy of the genetic 

code, various coding sequences may be constructed which 
encode the variable heavy and light chain amino acid 
sequences, and CDR sequences of the invention as well as 
functional fragments and analogs thereof which share the 

2 0 antigen specificity of the donor antibody. The isolated 
nucleic acid sequences of this invention, or fragments 
thereof, encoding the variable chain peptide sequences 
or CDRs can be used to produce altered antibodies, e.g., 
chimeric or humanized antibodies, or other engineered 

2 5 antibodies of this invention when operatively combined 

with a second immunoglobulin partner. 

It should be noted that in addition to isolated 
nucleic acid sequences encoding portions of the altered 
antibody and antibodies described herein, other such 

3 0 nucleic acid sequences are encompassed by the present 

invention, such as those complementary to the native 
CDR-encoding sequences or complementary to the human 
framework regions surrounding the CDR-encoding regions. 
Such sequences include all nucleic acid sequences which 
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by virtue of the redundancy of the genetic code are 
capable of encoding the same amino acid sequence as 
given in Figs. 3 and 4 [SEQ ID NOS : 2 and 4] . Figs. 6 
and 7 [SEQ ID NOS: 5-12] provide representations of such 
5 sequences. Other useful DNA sequences encompassed by 
this invention include those sequences which hybridize 
under stringent hybridization conditions [See: T. 
Maniatis et al . , Molecular Cloning (A Laboratory 
Manual) , Cold Spring Harbor Laboratory (1982), pages 387 

10 to 3 89] to the DNA sequences encoding the GX-1 

antibodies (e.g., sequences of Figs. 3, 4, 8A-8F through 
11 [SEQ ID NOS: 1-4, 13-16]) and which retain the 
antigen binding properties of those antibodies . An 
example of one such stringent hybridization condition is 

15 hybridization at 4XSSC at SS^'C, followed by a washing in 

O.IXSSC at 65°C for an hour. Alternatively an exemplary 
stringent hybridization condition is in 50% formamide, 

4XSSC at 42°C. Preferably, these hybridizing DNA 
sequences are at least about 18 nucleotides in length, 

20 i.e., about the size of a CDR. 

V. Altered Immunoglohulin Coding Regions and 
Altered Antibodies . 

Altered immunoglobulin coding regions encode 
altered antibodies which include engineered antibodies 

25 such as chimeric antibodies, humanized, reshaped, and 
immunologically edited human antibodies. A desired 
altered immunoglobulin coding region contains CDR- 
encoding regions in the form of scFv regions that encode 
peptides having the antigen specificity of an RSV 

3 0 antibody, preferably a high affinity antibody such as 
provided by the present invention, inserted into an 
acceptor immunoglobulin partner. 

When the acceptor is an immunoglobulin partner, as 
defined above, it includes a sequence encoding a second 

24 
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antibody region of interest, for example, an Fc region. 
Immunoglobulin partners may also include sequences 
encoding another immunoglobulin to which the light or 
heavy chain constant region is fused in frame or by 
5 means of a linker sequence. Engineered antibodies 

directed against functional fragments or analogs of RSV 
may be designed to elicit enhanced binding with the same 
antibody , 

The immunoglobulin partner may also be associated 

10 with effector agents as defined above, including non- 
protein carrier molecules, to which the immunoglobulin 
partner may be operatively linked by conventional means. 

Fusion or linkage between the immunoglobulin 
partners, e.g., antibody sequences, and the effector 

15 agent may be by any suitable means, e.g., by 

conventional covalent or ionic bonds, protein fusions, 
or hetero-bif unctional cross-linkers, e.g., 
carbodiimide, glutaraldehyde , and the like. Such 
techniques are known in the art and readily described in 

20 conventional chemistry and biochemistry texts. 

Additionally, conventional linker sequences which 
simply provide for a desired amount of space between the 
second immunoglobulin partner and the effector agent may 
also be constructed into the altered immunoglobulin 

25 coding region. The design of such linkers is well known 
to those of skill in the art. 

In addition, signal sequences for the molecules of 
the invention may be modified to enhance expression. 
For example the reshaped human antibody having the 

3 0 signal sequence and CDRs derived from the Fab GX.-1 heavy 
chain sequence, may have the original signal peptide 
replaced with another signal sequence such as the 
Campath leader sequence [Page, M. J. et al . , 
BioTechnology 9:64-68(1991)]. 
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An exemplary altered antibody, a reshaped human 
antibody, contains a variable heavy and the entire light 
chain peptide or protein sequence having the antigen 

specificity of Fab gX-I, fused to the constant heavy 
5 regions Ch-i-Ch-3 derived from a second human antibody. 

In still a further embodiment, the engineered 
antibody of the invention may have attached to it an 
additional agent. For example, the procedure of 
recombinant DNA technology may be used to produce an 

10 engineered antibody of the invention in which the Fc 
fragment or Ch-2Ch-3 domain of a complete antibody 
molecule has been replaced by an enzyme or other 
detectable molecule (i.e., a polypeptide effector or 
reporter molecule) . 

15 Another desirable protein of this invention may 

comprise a complete antibody molecule, having full 
length heavy and light chains, or any discrete fragment 
thereof, such as the Fab or F(ab')2 fragments, a heavy 
chain dimer, or any minimal recombinant fragments 

2 0 thereof such as an Fv or a single-chain antibody (SCA) or 
any other molecule with the same specificity as the 

selected donor Fab GX,-1. Such protein may be used in 
the form of an altered antibody, or may be used in its 
unfused form. 

2 5 Whenever the immunoglobulin partner is derived from 

an antibody different from the donor antibody, e.g., any 
isotype or class of immunoglobulin framework or constant 
regions, an engineered antibody results. Engineered 
antibodies can comprise immunoglobulin (Ig) constant 

3 0 regions and variable framework regions from one source, 

e.g., the acceptor antibody, and one or more (preferably 
all) CDRs from the donor antibody, e.g., the anti-RSV 
antibody described herein. In addition, alterations, 
e.g., deletions, substitutions, or additions, of the 
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acceptor mAb light and/or heavy variable domain 
framework region at the nucleic acid or amino acid 
levels, or the donor CDR regions may be made in order to 
retain donor antibody antigen binding specificity or to 
5 reduce potential immunogenicity . 

Such engineered antibodies are designed to employ 
one (or both) of the variable heavy and/or light chains 
of the RSV mAb (optionally modified as described) or one 
or more of the below-identified heavy or light chain 

10 CDRs . The engineered antibodies of the invention are 

neutralizing, i.e., they desirably inhibit virus growth 
in vitro and in vivo in animal models of RSV infection. 

Such engineered antibodies may include a reshaped 
human antibody containing the human heavy and light 

15 chain constant regions fused to the RSV antibody 
functional fragments. A suitable human (or other 
animal) acceptor antibody may be one selected from a 
conventional database, e.g., the KABAT® database, Los 
Alamos database, and Swiss Protein database, by homology 

2 0 to the nucleotide and amino acid sequences of the donor 

antibody. A human antibody characterized by a homology 
to the framework regions of the donor antibody (on an 
amino acid basis) may be suitable to provide a heavy 
chain constant region and/or a heavy chain variable 
25 framework region for insertion of the donor CDRs. A 
suitable acceptor antibody capable of donating light 
chain constant or variable framework regions may be 
selected in a similar manner. It should be noted that 
the acceptor antibody heavy and light chains are not 

3 0 required to originate from the same acceptor antibody. 

Desirably the heterologous framework and constant 
regions are selected from human immunoglobulin classes 
and isotypes, such as IgG (subtypes 1 through 4) , IgM, 
IgA and IgE. The Fc domains are not limited to native 



27 



wo 00/69462 PCT/USOO/13694 

sequences, but include mutant variants known in the art 
that alter function. For example, mutations have been 
described in the Fc domains of certain IgG antibodies 
that reduce Fc-mediated complement and Fc receptor 
5 binding [see, e.g., A. R. Duncan at al . , Nature , 

332:563-554 (1988); A. R. Duncan and G. Winter, Nature , 
332:738-740 (1988); M.-L. Alegre et al . , J. Immunol . , 
148:3461-3468 (1992); M.-H. Tao et al . , J . Exp , Med , , 
178:661-667 (1993); and V. Xu et ai . J. Biol. Chem , , 

10 269:3469-2374 (1994)]; alter clearance rate [J.-K. Kim 

et al., Eur. J. Immunol ., 24:542-548 (1994)]; and reduce 
structural heterogeneity [S. Angal et al . , Mol . Immunol . 
30:105-108 (1993)]. Also, other modifications are 
possible such as oligomerization of the antibody by 

15 addition of the tailpiece segment of IgM and other 
mutations [R. I. F. Smith and S. L. Morrison, 
Biotechnology 12:683-688 (1994); R. I. F. Smith et al . , 
J. Immunol ., 154: 2226-2236 (1995)] or addition of the 
tailpiece segment of IgA [I. Kariv et al . , J . Immunol . , 

20 157: 29-38 (1996)]. However, the acceptor antibody need 
not comprise only human immunoglobulin protein 
sequences. For instance a gene may be constructed in 
which a DNA sequence encoding part of a human 
immunoglobulin chain is fused to a DNA sequence encoding 

25 a non- immunoglobulin amino acid sequence such as a 
polypeptide effector or reporter molecule. 

The altered antibody thus preferably has the 
structure of a natural human antibody or a fragment 
thereof, and possesses the combination of properties 

30 required for effective therapeutic use, e.g., treatment 
of RSV mediated diseases in man, or for diagnostic uses. 

It will be understood by those sJcilled in the art 
that an altered antibody may be further modified by 
changes in variable domain amino acids without 
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necessarily affecting the specificity and high affinity 
of the donor antibody (i.e., an analog). It is 
anticipated that heavy and light chain amino acids may 
be substituted by other amino acids either in the 
5 variable domain frameworks or CDRs or both. 

Particularly preferred is the immunological editing of 
such reconstructed sequences as illustrated in the 
examples herein. 

In addition, the variable or constant region may be 
10 altered to enhance or decrease selective properties of 
the molecules of the instant invention, as described 
above. For example, dimerization , binding to Fc 
receptors, or the ability to bind and activate 
complement [see, e.g., Angal et al . , Mol . Immunol , 
15 3_0:105-108 (1993); Xu et ai . , J. Biol. Chem , 269:3469- 
3474 (1994); and Winter et al . , EP 307,434-B]. 

Such antibodies are useful in the prevention and 
treatment of RSV mediated disorders, as discussed below. 

VI, Production of Altered antibodies and 

2 0 Engineered Antibodies. 

The resulting reshaped human antibodies of this 
invention can be expressed in recombinant host cells, 
e.g., COS, CHO or myeloma cells. A conventional 
expression vector or recombinant plasmid is produced by 
25 placing these coding sequences for the altered antibody 
in operative association with conventional regulatory 
control sequences capable of controlling the replication 
and expression in, and/or secretion from, a host cell. 
Regulatory sequences include promoter sequences, e.g., 

3 0 CMV promoter, and signal sequences, which can be derived 

from other known antibodies. Similarly, a second 
expression vector can be produced having a DNA sequence 
which encodes a complementary antibody light or heavy 
chain. Preferably this second expression vector is 
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identical to the first except insofar as the coding 
sequences and selectable markers are concerned. This 
ensures as far as possible that each polypeptide chain 
is functionally expressed. Alternatively, the heavy and 
5 light chain coding sequences for the altered antibody 
may reside on a single vector. 

A selected host cell is co- transf ected by 
conventional techniques with both the first and second 
vectors (or simply transfected by a single vector) to 

10 create the transfected host cell of the invention 

comprising both the recombinant or synthetic light and 
heavy chains. The transfected cell is then cultured by 
conventional techniques to produce the engineered 
antibody of the invention. The production of the 

15 antibody which includes the association of both the 

recombinant heavy chain and light chain is measured in 
the culture by an appropriate assay, such as an enzyme- 
linked immunosorbent assay (ELISA) or radioimmunoassay 
(RIA) . Similar conventional techniques may be employed 

2 0 to construct other altered antibodies and molecules of 

this invention. 

Suitable vectors for the cloning and subcloning 
steps employed in the methods and construction of the 
compositions of this invention may be selected by one of 

2 5 skill in the art. For example, the conventional pUC 

series of cloning vectors, may be used. One vector used 
is pUC19, which is commercially available from supply 
houses, such as Amersham (Buckinghamshire, United 
Kingdom) or Pharmacia (Uppsala, Sweden) . Any vector, 

3 0 which is capable of replicating readily, has an 

abundance of cloning sites and selectable genes (e.g., 
antibiotic resistance) , and is easily manipulated, may 
be used for cloning. Thus, the selection of the cloning 
vector is not a limiting factor in this invention. 
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Similarly, the vectors employed for expression of 
the engineered antibodies according to this invention 
may be selected by one of skill in the art from any 
conventional vectors. Preferred vectors include for 
5 example plasmids pCD or pCN. The vectors also contain 
selected regulatory sequences (such as CMV promoters) 
which direct the replication and expression of 
heterologous DNA sequences in selected host cells . 
These vectors contain the above described DNA sequences 

10 which code for the engineered antibody or altered 

immunoglobulin coding region. In addition, the vectors 
may incorporate the selected immunoglobulin sequences 
modified by the insertion of desirable restriction sites 
for ready manipulation. 

15 The expression vectors may also be characterized by 

genes suitable for amplifying expression of the 
heterologous DNA sequences, e.g., the mammalian 
dihydrof olate reductase gene (DHFR) . Other preferable 
vector sequences include a polyadenylation (polyA) 

2 0 signal sequence, such as from bovine growth hormone 

(BGH) and the betaglobin promoter sequence (betaglopro) . 
The expression vectors useful herein may be synthesized 
by techniques well known to those skilled in this art. 
The components of such vectors, e.g. replicons, 
25 selection genes, enhancers, promoters, signal sequences 
and the like, may be obtained from commercial or natural 
sources or synthesized by known procedures for use in 
directing the expression and/or secretion of the product 
of the recombinant DNA in a selected host. Other 

3 0 appropriate expression vectors of which numerous types 

are known in the art for mammalian, bacterial, insect, 
yeast, and fungal expression may also be selected for 
this purpose. 
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The present invention also encompasses a cell line 
transfected with a recombinant plasmid containing the 
coding sequences of the engineered antibodies or altered 
immunoglobulin molecules thereof. Host cells useful for 
5 the cloning and other manipulations of these cloning 

vectors are also conventional. However, most desirably, 
cells from various strains of E. coll are used for 
replication of the cloning vectors and other steps in 
the construction of altered antibodies of this 
10 invention. 

Suitable host cells or cell lines for the 
expression of the engineered antibody or altered 
antibody of the invention are preferably mammalian cells 
such as CHO, COS, a fibroblast cell (e.g., 3T3), and 
15 myeloid cells, and more preferably a CHO or a myeloid 
cell. Human cells may be used, thus enabling the 
molecule to be modified with human glycosylation 
patterns. Alternatively, other eukaryotic cell lines 
may be employed. The selection of suitable mammalian 

2 0 host cells and methods for transformation, culture, 

amplification, screening and product production and 
purification are known in the art. See, e.g., Sambrook 
et al . , Molecular Cloning (A Laboratory Manual) , 2nd 
edit., Cold Spring Harbor Laboratory (1989) . 
25 Bacterial cells may prove useful as host cells 

suitable for the expression of the recombinant scFvs, 
Fabs and MAbs of the present invention [see, e.g., 
Pluckthun, A., Immunol. Rev. , 130 : 151-188 (1992)]. The 
tendency of proteins expressed in bacterial cells to be 

3 0 in an unfolded or improperly folded form or in a non- 

glycosylated form does not pose as great a concern 
because Fabs are not normally glycosylated and can be 
engineered for exported expression, thereby reducing the 
high concentration that facilitates misfolding. 
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Nevertheless, any recombinant Fab produced in a 

bacterial cell would be screened for retention of 

antigen binding ability. If the molecule expressed by 

the bacterial cell was produced and exported in a 

5 properly folded form, that bacterial cell would be a 

desirable host. For example, various strains of E. coli 

used for expression are well-known as host cells in the 

field of biotechnology. Various strains of B, subtilis, 

Streptomyces , other bacilli and the like may also be 

10 employed in this method. 

Where desired, strains of yeast cells known to 
those skilled in the art are also available as host 
cells, as well as insect cells, e.g. Drosophila and 
LepidoptGra. and viral expression systems [see, e.g. 

15 Miller et ai . , Genetic Engineering , 8:277-298, Plenum 
Press (1986) and references cited therein] . 

The general methods by which the vectors of the 
invention may be constructed, the transfection methods 
required to produce the host cells of the invention, and 

20 culture methods necessary to produce the altered 

antibody of the invention from such host cell are all 
conventional techniques. Likewise, once produced, the 
altered antibodies of the invention may be purified from 
the cell culture contents according to standard 

25 procedures of the art, including ammonium sulfate 

precipitation, affinity columns, column chromatography, 
gel electrophoresis and the like. Such techniques are 
within the skill of the art and do not limit this 
invention . 

30 Yet another method of expression of reshaped 

antibodies may utilize expression in a transgenic 
animal. An exemplary systems is described in U. S. 
Patent No. 4,873,316. The expression system described 
in that reference uses the animal's casein promoter and. 
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when transgenically incorporated into a mammal, permits 

the female to produce the desired recombinant protein in 
its milk. 

Once expressed by the desired method, the 
5 engineered antibody is then examined for in vitro 

activity by use of an appropriate assay. At present, 
conventional ELISA assay formats are employed to assess 
qualitative and quantitative binding of the altered 
antibody to RSV. Additionally, other in vitro assays 
10 and in vivo animal models may also be used to verify 

neutralizing efficacy prior to subsequent human clinical 
studies performed to evaluate the persistence of the 
altered antibody in the body despite the usual clearance 
mechanisms . 

15 VII. Therapeutic /Prophylactic Uses. 

This invention also relates to a method of treating 
humans experiencing RSV-related symptoms which comprises 
administering an effective dose of antibodies including 
one or more of the antibodies (altered, reshaped, 
2 0 monoclonal, etc.) described herein or fragments thereof. 

The therapeutic response induced by the use of the 
molecules of this invention is produced by binding to 
RSV and thus subsequently blocking RSV propagation. 
Thus, the molecules of the present invention, when in 

2 5 preparations and formulations appropriate for 

therapeutic use, are highly desirable for those persons 
experiencing RSV infection. For example, longer 
treatments may be desirable when treating seasonal 
episodes or the like. The dose and duration of 

3 0 treatment relates to the relative duration of the 

molecules of the present invention in the human 
circulation, and can be adjusted by one of skill in the 
art depending upon the condition being treated and the 
general health of the patient. 
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The altered antibodies, antibodies and fragments 

thereof of this invention may also be used alone or in 

conjunction with other antibodies, particularly human or 

humanized mAbs reactive with other epitopes on the F 

5 protein or other RSV target antigens as prophylactic 

agents . 

The mode of administration of the therapeutic and 
prophylactic agents of the invention may be any suitable 
route which delivers the agent to the host. The altered 

10 antibodies, antibodies, engineered antibodies, and 

fragments thereof, and pharmaceutical compositions of 
the invention are particularly useful for parenteral 
administration, i.e. , subcutaneously , intramuscularly, 
intravenously, or intranasally . 

15 Therapeutic and prophylactic agents of the 

invention may be prepared as pharmaceutical compositions 
containing an effective amount of the altered antibody 
of the invention as an active ingredient in a 
pharmaceutically acceptable carrier. An aqueous 

2 0 suspension or solution containing the antibody, 

preferably buffered at physiological pH, in a form ready 
for injection is preferred. The compositions for 
parenteral administration will commonly comprise a 
solution of the engineered antibody of the invention or 
25 a cocktail thereof dissolved in an pharmaceutically 

acceptable carrier, preferably an aqueous carrier. A 
variety of aqueous carriers may be employed, e.g., 0.4% 
saline, 0.3% glycine, and the like. These solutions are 
sterile and generally free of particulate matter. These 

3 0 solutions may be sterilized by conventional, well known 

sterilization techniques (e.g., filtration). The 
compositions may contain pharmaceutically acceptable 
auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and 
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buffering agents, etc. The concentration of the 
antibody of the invention in such pharmaceutical 
formulation can vary widely, i.e., from less than about 
0.5%, usually at or at least about 1% to as much as 15 
5 or 20% by weight and will be selected primarily based on 
fluid volumes, viscosities, etc., according to the 
particular mode of administration selected. 

Thus, a pharmaceutical composition of the invention 
for intramuscular injection could be prepared to contain 

10 1 mL sterile buffered water, and between about 1 ng to 
about 10 0 mg, e.g. about 5 0 ng to about 8 0 mg, or more 
preferably, about 5 mg to about 75 mg, of an engineered 
antibody of the invention. Similarly, a pharmaceutical 
composition of the invention for intravenous infusion 

15 could be made up to contain about 250 ml of sterile 
Ringer's solution, and about 1 to about 75 and 
preferably 5 to about 5 0 mg/ml of an engineered antibody 
of the invention. Actual methods for preparing 
parenterally administrable compositions are well known 

2 0 or will be apparent to those skilled in the art and are 

described in more detail in, for example, Remington's 
Pharmaceutical Science, 15th ed. , Mack Publishing 
Company, Easton, Pennsylvania. 

It is preferred that the therapeutic and 
25 prophylactic agents of the invention, when in a 

pharmaceutical preparation, be present in unit dose 
forms. The appropriate therapeutically effective dose 
can be determined readily by those of skill in the art. 
To effectively treat an inflammatory disorder in a human 

3 0 or other animal, one dose of approximately 0.1 mg to 

approximately 2 0 mg per 7 0 kg body weight of a protein 
or an antibody of this invention should be administered 
parenterally, preferably i.v. or i.m. (intramuscularly). 
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Such dose may, if necessary, be repeated at appropriate 

time intervals selected as appropriate by a physician. 

The altered antibodies and engineered antibodies of 
this invention may also be used in diagnostic regimens, 
5 such as for the determination of RSV mediated disorders 
or tracking progress of treatment of such disorders. As 
diagnostic reagents, these altered antibodies may be 
conventionally labeled for use in ELISAs and other 
conventional assay formats for the measurement of RSV 

10 levels in serum, plasma or other appropriate tissue, or 
the release by human cells in culture. The nature of 
the assay in which the altered antibodies are used are 
conventional and do not limit this disclosure. 

The antibodies, altered antibodies or fragments 

15 thereof described herein can be lyophilized for storage 
and reconstituted in a suitable carrier prior to use. 
This technique has been shown to be effective with 
conventional immunoglobulins and art-known 
lyophilization and reconstitution techniques can be 

2 0 employed. 

The following examples illustrate various aspects 
of this invention including the construction of 
exemplary engineered antibodies and expression thereof 
in suitable vectors and host cells, and are not to be 
25 construed as limiting the scope of this invention. All 
amino acids are identified by conventional three letter 
or single letter codes. All necessary restriction 
enzymes, plasmids , and other reagents and materials were 
obtained from commercial sources unless otherwise 

3 0 indicated. All general cloning ligation and other 

recombinant DNA methodology were as performed in T. 
Maniatis et al . , cited above, or Sambrook et al . , cited 
above . 
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Example 1 : Isolation of gX-1 scFv-1 

Single chain (sc) Fv libraries were prepared from 
an individual purposely exposed to RSV and selected 
against recombinant RSV F-protein following described 
5 procedures [R. H. Jackson et al, in Protein Engineering, 
A Practical Approach, A. R. Rees et al eds , Oxford 
University Press, chapter 12, pp. 277-301, 1992; H. R. 
Hoogenboom et al . , Nucl . Acid Res . , 19: 4133-4137 
(1991); J. D. Marks et al . , J. Mol , Biol . , 222: 581-597 
10 (1991)]. Briefly, lymphocytes were isolated from a 

blood sample taken 15 days post exposure. RNA isolated 
from the lymphocytes was used for preparation of scFv 
encoding repertoires for phage display. Sets of V- 
region primers were paired with constant region primers 

15 for heavy chain domain 1 IgG and IgM and light chain C-k 

and C-X and then linked in a scFv VH-VL orientation with 
a 15 amino acid spacer (glycine4-serine) 3 [SEQ ID NO: 21] 
by overlap PGR [see J. D. Marks et al . , cited above, for 
description of the primers] . 

20 The resulting four scFv repertoires (V-K with IgG 

and IgM, V-?i with IgG and IgM) were cloned into a 
phagemid vector similar to pHENl [H. R. Hoogenboom et 
al . , cited above] resulting in fusion of the scFvs to 
gene III of phage fd. The vector was then transformed 

25 into E. coll (e.g., strain TGI) by electroporation to 
yield the corresponding phagemid libraries. 

Phage libraries displaying the scFv-gene 3 fusions 
were prepared by infection of each of the plasmid 
libraries with the M13K07 helper phage [R. H. Jackson, 

3 0 cited above] and were individually subjected to 2 rounds 
of panning against recombinant F-protein coated onto 
plastic. In the first round, 10-^"^ phage in 2.5 ml 

phosphate buffered saline (PBS)/2% Marval™ non-fat dry 
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milk were incubated for 90 minutes in a tube coated with 

5 p,g/ml of F-protein [described in P. Tsui et ai, J . 
Immunol . , 157 : 772-780 (1996)] followed by 1 wash with 
lOx PBS/0.05% Tween 20 and a second wash with lOx PBS 
5 alone. Bound phage were eluted with 10 mM triethylamine 
and the eluate was neutralized with 1 M Tris-HCl, pH 
7.4. The eluted phage were amplified and subjected to a 
similar second round of panning, except that the 

concentration of F-protein for coating was 2 ^Lg/ml and 

10 the wash buffer contained 2 Ox PBS. 

E. coli were infected with the eluted phage and 96 
colonies from each starting library were superinf ec ted 
with helper phage and screened for F-protein binding 
activity. Only four positive clones were obtained from 

15 the 2 IgM libraries, whereas 41 positives were observed 
for the IgG libraries. By partial sequence analysis, 
all of the clones carried one of three different heavy 
chains . Complete sequences were obtained for the heavy 
and light chain V-regions for six clones, all from the 

20 IgG libraries. 

Serial dilutions of titered phage stocks of each of 
these six clones were tested by ELISA for binding to 
recombinant F-protein and to RSV infected cell lysate . 
All showed binding to F-protein with the phage 

2 5 designated gA,-1 showing the best activity. However, G^- 
1 and three other clones showed little binding to the 
RSV lysate. 

Three clones: G^-1, GX,-3 (lysate binding 

positive) , and Gk-1 (lysate binding negative) , where "K" 

30 and "X" designate the class of the light chain, were 

characterized further for competition of their binding 
by F-protein specific neutralizing monoclonal 
antibodies, and their ability to inhibit virus 
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infection. The neutralizing mAbs RSV19 and B4 described 
in International patent publication No. WO92/04381, 
published March 19, 1992, and International patent 
publication No. WO93/20210, published October 14, 1993, 

5 recognize distinct epitopes on the F-protein. GK-1 was 
strongly inhibited by both antibodies. GX-1 was 
significantly inhibited by B4 only. Gk-3 was not 
inhibited by either antibody (shown for gX-1 only; see 
Figs. lA and IB). In initial assays (Table I, 
10 experiments 1-3), all three clones showed neutralizing 

activity in vitro, with gX-1 being the most potent (Fig. 
2, a graph of experiment 2), while control wild-type 
phage (M13K07) not displaying scFv had no effect. 

To address the possibility that neutralization 

15 might result just from phage coating of virus, 

irrespective of epitope, a phage preparation of the non- 
neutralizing Fab 5-16 was tested in the same assay. In 
three out of four assays, this preparation also showed 
good neutralization activity, as did the control phage 

20 in two of these assays (Table I, experiments 4-7) . This 
confounding observation of variable neutralization by 
both Fab 5-16 and control M13K07 phage rendered the 
viral neutralization studies inconclusive. 



40 



wo 00/69462 



Table 1 



PCT/USOO/13694 



Phage 
Sample 


Virus Neutralization (IC50 x 10 

(aru or kru/ml)^ 




Experiment # 






z 










/ 


GK-1 a 


1, 600 




<300 










b 








<10 


<7 










80 


<300 










b 








8 . 1 


11 






c 














120 


Gk~3 a 




900 


<300 


180 








b 










<7 


10 




c 














730 


M13K07a 






>10'' 


>10^ 




>5 , 000 




b 










+all dil. 


+all dil. 


>10' 


Fab 5-19a 








>10^ 


40 


180 




b 














3 . 5 



Legend: 

10 Assay according to M. J. Cannon, J. Virol. Meth. , 

16:293-301. Virus at 100 infectious centers/well 
was incubated with dilutions of the indicated phage 
for 1 hr and then added to susceptible cells for 3 
hr. The virus /phage solution was aspirated and 
15 replaced with fresh medium and the cells were 

incubated overnight before peroxidase staining for 
virus infected cells. 

^ aru = ampicillin resistance units, a measure of 

20 phagmid containing particles. 

kru = kanamycin resistance units, a measure of 
particles containing the phage genome (for the 
M13K07 control only) . 

25 
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In the face of these results, made more ambiguous 

by the dependence of all assays on phage stocks verses 

antibody proteins of known concentration, gX-1 was 
selected as the most likely candidate for a potent 
5 neutralizing antibody based on (1) its apparent better 
binding to F-protein, (2) its selective inhibition of 
binding by the B4 antibody, and (3) its suggested 
activity over background in the virus neutralization 
assay . 

10 

Example 2 : Conversion of gX-1 scFV to mAb Version A 

The DNA and encoded protein sequences of the VH and 

VL regions of GX-I are shown in Figs. 3 [SEQ ID NOS : 1 
and 2] and 4 [SEQ ID NOS: 3 and 4], respectively. For 
15 expression in mammalian cells, the heavy chain variable 

region and the light chain variable region from the GX-1 
plasmid were cloned into derivatives of plasmid pCDN 
[Nambi, A. et al . , Mol . Cell . Biochem . , 131:75-86 
(1994)] in which the expression of the antibody chain is 

2 0 driven by the cytomegalovirus promoter (CMV) promoter. 
Plasmid pCD-HC68B is used for expressing full length 
heavy chains and plasmid pCN-HuLC, for expressing full 
length light chains. 

In the initial constructs, changes in the sequence 

2 5 at the amino terminus were introduced by the PCR primers 
used for cloning the light chain and heavy chain 

variable regions from plasmid gX-1 . In these 
constructs, the peptide signal sequence for both the 
heavy and light chains is derived from the Campath light 
30 chain [M. J. Page et al . , Biotechnology 9: 64-68 

(1991) ] . The heavy chain of gX~1 was PCR amplified from 

GX-1 phagemid DNA, using primers for the amino terminus 

and framework 4 of the variable reaion. The resultina 
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PCR fragment was cut with Xhol (site introduced by the 

amino terminus primer) and BstEII (naturally occurring 
site in framework 4), and cloned into an intermediate 
vector, F4HCV, at the XhoI/BstEII sites. 

5 This cloning grafted the variable region of gX-I 

onto the constant region of another anti-RSV heavy chain 
194-F4 [cloned at SmithKline Beecham from a human 
hybridoma] . This intermediate clone was cut with Xhol 
and Bspl2 0l, and introduced into the same sites in pCD- 

10 HC58B. The Xhol site is introduced at the amino 

terminus by the PCR primer and, when cloned into pCD- 
HC68B at the same site is preceded in frame by the 
Campath leader sequence. The Bspl20I site is a 
naturally occurring, highly conserved sequence at the 

15 beginning of the Ch-i domain, and when cloned into pCD- 
HC68B at the same site, is in frame with the remaining 
sequence for the Ch-i through Ch-3 regions of human IgGi . 

In the resulting construct, GX,-lApcd (Figs. 8A-8F [SEQ 
ID NO: 13]), the amino acids immediately following the 

2 0 Campath leader are EVQLLE [SEQ ID NO: 17], where the 

residues LE are encoded by the nucleotide sequence for 
the Xhol cloning site. 

The light chain of gX-1 was PCR amplified from the 

GJi-1 phagemid DNA, using primers for the amino terminus 
25 and frameworlc 4 of the variable region. The resulting 
PCR fragment was cut with Sad (site introduced by the 
amino terminus primer) and Avrll (naturally occurring 
site in frameworlc 4) , and cloned into 43-lpcn at the 
Sacl/Avrll sites. This cloning grafted the variable 

3 0 region of G^-1, in frame, onto the constant region of 

another anti-RSV lambda light chain 43 [P. Tsui et al . , 
J. Immunol . , 157: 772-780 (1996)], which had been cloned 
at SmithKline Beecham from a combinatorial library 
derived from RNA isolated from human spleen. The Sad 
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site is introduced at the amino terminus by the PGR 
primer and, when cloned into 43pcn at the same site, is 
preceded in frame by the Campath leader sequence. The 
first two amino acids of the mature light chain are 

5 therefore deleted. In the resulting construct, Gy\.-lApcn 
(Figs. 9A-9E [SEQ ID NO: 14]), the first two amino acids 
immediately following the leader are EL, where the 
residues EL are encoded by the nucleotide sequence for 
the Sad cloning site. 

10 The nucleotide sequences of the plasmids GX-lApcd 

and GX-lApcn are shown in Figs. 8A-8F [SEQ ID NO: 13] 
and 9A-9E [SEQ ID NO: 14] respectively. This set of 
vectors was used to produce antibody G?i-1A in COS cells 
and in CHO cells. 

15 

Example 3 : Cloning Of The Corrected GA.-1 Heavy and Light 
Chains 

In cloning the variable region of the GA.-1 heavy 
chain from the single chain Fv (scFv) format into the 
20 full length format, the fifth amino acid at the amino 
terminus was changed from Val to Leu, for cloning 
purposes. To correct this change, PCR primers were 

designed for the amino terminus of the gA,-1 heavy chain 
cloned into pCD, which reverted the fifth amino acid 

25 baclc to Val. The correction was introduced via the PCR 
overlap technique using the correction primers and 
primers annealing to sequences within the CMV promoter 
and the Ch-2 constant region as the outside 5' and 3' 
primers, respectfully. The final PCR product was 

30 digested with restriction enzymes, EcoRI and Bspl20l, 

and cloned into the G^-lApcd vector at the same sites to 
create G?l-lBpcd. 
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The final construct was sequenced to verify that 
the amino terminus of the heavy chain had been corrected 
from EVQLLE [SEQ ID NO : 17] to EVQLVE [ SEQ ID NO : 18] 
(see Fig 6) , The nucleotide sequence of coding region 

5 for the corrected heavy chain, gA,-1B, is shown in Figs. 
lOA-lOB [SEQ ID NO: 15] . 

In cloning the variable region of the gA.~1 light 
chain from the scFv format into the full length format, 
changes were introduced at the amino terminus for 
10 cloning purposes. Specifically, the first 2 amino acids 
(Gin and Ser) of the light chain were deleted and the 
third amino acid was changed from Val to Glu. To 
correct these changes, PGR primers were designed for the 

amino terminus of the gX-1 light chain cloned into pCN, 
15 which replaced the two deleted amino acids (Gin and Ser) 
and reverted the third amino acid baclc to Val. The 
corrections were introduced via the PGR overlap 
technique using the correction primers and primers 

annealing to sequences within the CMV promoter and the X 
20 constant region as the outside 5' and 3' primers, 

respectfully. The final PGR product was digested with 
restriction enzymes, EcoRI and Avrll and cloned into the 

GX-lApcn vector at the same sites to create G?i-lBpcn. 
The final construct was sequenced to verify that 

2 5 the amino terminus of the light chain had been corrected 

from --EL to QSVL (amino acids 1-4 of SEQ ID NO: 10) . 
The nucleotide sequence of coding region for the 

corrected light chain, gX-IB, is shown in Fig. 11 [SEQ 
ID NO: 16] . This vector GX.-lBpcn, was used with GX-- 

3 0 IBpcd to produce antibody GA.-1B, in GOS cells and in CHO 

cells . 
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Example 4 : Production of gX-I mABs in Mammalian Cells 

For initial characterization, the mAb constructs 
for each version, GX.-1A heavy and light chain, Gk-IB 
heavy and light chain, were expressed in COS cells 
5 essentially as described in Current Protocols in 

Molecular Biology, eds F. M. Ausubel et al . , 1988, John 
Wiley 8c Sons, vol. 1, section 9.1. On day 1 after the 
transf ection, the culture growth medium was replaced 
with a serum- free medium [SmithKline Beecham] which was 

10 changed on day 3 . Similar satisfactory results are 
obtained using a publicly available medium, DMEM 
supplemented with ITS™ Premix, an insulin, transferrin, 
selenium mixture (Collaborative Research, Bedford, MA) 
and 1 mg/ml bovine serum albumin (BSA) . 

15 The mAb was prepared from the day 3 + day 5 

conditioned medium by standard protein A affinity 
chromatography methods (e.g., as described in Protocols 
in Molecular Biology) using, for example, Prosep A 
affinity resin (Bioprocessing Ltd. , UK) . 

20 To produce larger quantities of the GA.-1B mAB (100- 

2 00 mgs), the vectors were introduced into a proprietary 
CHO cell system. However, similar results will be 
obtained using dhfr" CHO cells as previously described 
[P. Hensley et al . , J. Biol . Chem. , 269:23949-23958 

25 (1994)]. Briefly, a total of 30 [ig of linearized 

plasmid DNA (15 |LLg each of the A or B set of heavy chain 
and light chain vectors) is electroporated into 1x10 
cells. The cells are initially selected in nucleoside- 
free medium in 96 well plates. After three to four 
3 0 weeks, media from growth positive wells is screened for 
human immunoglobulin using an ELISA assay. The highest 
expressing colonies are expanded and selected in 
increasing concentrations of methotrexate for 
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amplification of the transfected vectors. The antibody 
is purified from conditioned medium by standard 
procedures using protein A affinity chromatography 
(Protein A sepharose, Pharmacia) followed by size 
5 exclusion chromatography (Superdex 200, Pharmacia). 

The concentration and the antigen binding activity 
of the eluted antibody are measured by ELISA. The 
antibody containing fractions are pooled and further 
purified by size exclusion chromatography. As expected 

10 for any such antibody, by SDS-PAGE, the predominant 

protein product migrated at approximately 15 0 kd under 
non-reducing conditions and as two bands of 5 0 and 25 kd 
under reducing conditions. For antibody produced in CHO 
cells, the purity was > 90%, as judged by SDS-PAGE, and 

15 the concentration was accurately determined by amino 
acid analysis. 



Example 5 : Binding of the GX-1 mABs to recombinant F 
protein 

2 0 Binding of the G^-1 mABs to recombinant F protein 

was measured in a standard solid phase ELISA. Antigen 
diluted in PBS pH 7.0 was adsorbed onto polystyrene 
round-bottom microplates (Dynatech, Immunolon II) for 18 
hours. Wells were then aspirated and blocked with 0.5% 

2 5 boiled casein (BC) in PBS containing 1% Tween 2 0 

(PBS/0.05% BC) for two hours. Antibodies (50 (ll/well) 
were diluted to varying concentrations in PBS/0.5% BC 
containing 0.025% Tween 20 and incubated in antigen 
coated wells for one hour. Plates were washed three 

3 0 times with PBS containing 0.05% Tween 20, using a 

Titertek 320 microplate washer, followed by addition of 

HRP-labelled protein A/G (50 |ll ) diluted 1:5000. After 
washing three times, TMBlue substrate (TSI, #TM102) was 
added and plates were incubated an additional 15 
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minutes. The reaction was stopped by addition of 1 N 
H2SO4 and absorbance read at 450 nm using a Biotek ELISA 
reader . 

The antigen binding epitope of the mABs was 

5 examined in a competition ELISA. The mABs were 

mixed with increasing concentrations of RSMU19 or B4 , 
two potent neutralizing mAbs [Tempest et al . , Biotech. , 
9: 266-271 (1991); Kennedy et ai . , J. Gen. Virol. , 69: 
3023-3032 (1988)] and added to F protein-coated wells. 
10 The epitope regions recognized by mAbs RSMU19 and B4 are 
quite distinct from each other as previously described 
in Arbiza et al . , J. Gen. Virol . , 73: 2225-2234 (1992). 

The concentration of the gX-1 mABs used in competition 
studies was determined previously to give 9 0% maximal 
15 binding to F antigen. Binding of the gX-1 mABs in the 
presence of other mABs was detected using HRP-labelled 
goat anti-human IgG. The reaction was developed as 
stated above. 

The gA,~1 mABs demonstrated potent binding to 
20 recombinant F (rF) protein by ELISA (EC50 for mAB B = 2.6 
ng/ml) . Binding of the GA,-1 mABs to rF protein was 
inhibited by mAb B4 , for which the F protein amino acids 
critical for antigen recognition are amino acids 2 68, 

272 and 27 5 of SEQ ID NO : 20) . Binding of the GX-1 mABs 
25 to rF protein was not inhibited by mAb RSMU19, for which 
F protein amino acid 429 of SEQ ID NO: 20 is critical 
for antigen recognition. These results indicate that 
residues in the region of amino acids 255-275 of the F 

protein [SEQ ID NO: 20] are critical for gX-1 mAB 
3 0 recognition. 
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Example 6 ; In vitro Fusion-Inhibition Activity of the 

GA.-1 luABs 

The ability of the gX-1 mABs to inhibit virus- 
induced cell fusion was determined using a modification 
5 of the in vitro microneutralization assay [Beeler et 

ai., J. Virol . , 63:2941-2950 (1989)]. In this assay, 50 

|il of RS Long strain virus (10-100 TCIDso/well [American 
Type Culture Collection ATCC VR-2 6] were mixed with 0.1 
ml VERO cells (SXlOVwell) [ATCC CCL-81] in Minimum 
10 Essential Media (MEM) containing 2% fetal calf serum 

(PCS), for 4 hours at 37°C, 5% CO2 . Serial two-fold 

dilutions (in quadruplicate) of mAB (50 \xl) were then 
added to wells containing virus-infected cells. Control 
cultures contained cells incubated with virus only 
15 (positive virus control) or cells incubated with media 
alone . 

Cultures were incubated at 3 7*^C in 5% CO2 for 6 days 
at which time cytopathic effects (CPE) in virus control 
wells were > 90%. Microscopic examination for 
2 0 cytopathic effects were confirmed by ELISA. Media was 

aspirated from cultures and replaced with 50 )ll of 90% 
methanol containing 0.6% H2O2 . After 10 minutes, 
fixative was aspirated and plates were air dried 
overnight. Viral antigen was detected in the fixed 

25 cultures using 1 |ig/ml biotinylated RSCHB4 (a human Fc 
derivative of the bovine B4 mAb [SmithKline Beecham] ) , 
followed by HRP-labelled streptavidin (Boehringer- 
Mannheim) diluted 1:10,000. The reaction was developed 
using TMBlue and stopped by addition of IN H2SO4. 

30 Absorbance was measured at 450 nm (O.D.450). 

Fusion-inhibition titers were defined as the 
concentration of antibody which caused a 50% reduction 
in ELISA signal (ED50) as compared to virus controls. 

49 
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Based on the curve generated in the ELISA by the 

standard virus titration, a 50% reduction in O.D.450 
corresponded to > 90% reduction in virus titer. 
Calculation of the 50% point was based on regression 
5 analysis of the dose titration. 

The gX-1 inABs demonstrated potent In vitro fusion- 
inhibition activity against type A RS Long strain virus 
(ED50 for iTiAB B of 0.51 + 0.3 8 |ig/ml) . In this in vitro 
fusion-inhibition assay, gX-1 mAB B was more active than 

10 the humanized mAB RSHZ19 (ED50 of 0.4-3.0 |Llg/ml) [Wyde et 
al . , Pediatr . Res . , 3 8 { 4 ) : 543 -550 ] in comparative 
assays . 

Example 7 : In vivo Activity of G>l-1 mAB B: Prophylaxis 
15 and Therapy in Balb/c Mouse Model 

Balb/c mice (5/group) were inoculated 
intraperitoneally with doses ranging from 0.06 mg/kg to 

5 mg/kg of gX-1 mAB B either 24 hours prior 
(prophylaxis) or 4 days after (therapy) intranasal 
20 infection with 10^ PFU of the A2 strain of human RSV. 

Mice were sacrificed 5 days after infection. Lungs were 
harvested and homogenized to determine virus titers. 

Virus was undetectable in the lungs of mice treated 

prophylactically with > 1.25 mg/kg GA,-1 mAB B either 
25 prophylactically or therapeutically. See Table II 

below. Significant viral clearance {2-3 logio) was also 

achieved in animals receiving 0.31 mg/kg GA.-1 mAB B 
either prophylactically or therapeutically. 

30 
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Therapy In Balb/c 



Mice 



Treatment 



Dose 
(mg/kg) 



Lung Virus Titer { logic /g lung) 
Prophylaxis Therapy 



GA.-1 mAB B 



5 

1.25 

0 .31 
0 . 06 



<1.7 

<1 . 7 
1.8 + 0.3 
4.3 + 0.7 



<1 . 7 
<1.7 
2.9 + 0.4 
4.5 + 0.3 



10 



PBS 



4.8 + 0.7 



4.7 + 0.2 



The GX-1 mABs have potent antiviral activity 



in vitro against a broad range of native RSV isolates of 
15 both type A and B, and show prophylactic and therapeutic 

efficacy in vivo in animal models. Thus, the gX-1 mABs 
are candidates for therapeutic, prophylactic, and 
diagnostic application in man. 



2 0 present invention may be made by one of skill in the art 
in view of the invention described herein. Such 
modifications are believed to be encompassed by the 
specification and claims of the present invention. All 
references cited above are incorporated by reference 

2 5 herein. 



Numerous modifications and variations of the 
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1 . A human monoclonal antibody and 
functional fragments thereof, specifically reactive with 
an F protein epitope of Respiratory Syncytial Virus and 
capable of neutralizing infection by said virus selected 

from the group consisting of GX-IA and Gy*l-1B. 

2 . The monoclonal antibody according to 
Claim 1 which comprises the light chain amino acid 
sequence of Fig. 3 SEQ ID NO : 2 and the heavy chain 
amino acid sequence of Fig. 4 SEQ ID NO: 4. 

3 . The monoclonal antibody according to 
Claim 1 which comprises the light chain amino acid 
sequence encoded by the DNA sequence of Fig. 11 SEQ ID 
NO: 16 and the heavy chain amino acid sequence encoded 
by the DNA sequence of Figs. lOA-lOB SEQ ID NO: 15. 

4. The monoclonal antibody according to 
Claim 1 wherein said fragment is selected from the group 
consisting of Fv, Fab and F(ab')2- 

5. An isolated nucleic acid molecule 
selected from the group consisting of: 

(a) a nucleic acid sequence encoding any 
of the human monoclonal antibodies, altered antibodies 
and CDRs of any of the claims 1-4; 

(b) a nucleic acid complementary to any 
of the sequences in (a) ; and 

(c) a nucleic acid sequence of 18 or more 
nucleotides capable of hybridizing to the CDRs of any of 
claims 1-4 under stringent conditions. 
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6. The isolated nucleic acid molecule 
according to Claim 5 comprising the sequences of Figs. 
8A-8F and 9A-9E SEQ ID NOS : 13 and 14, or Figs. lOA-lOB 
and 11 SEQ ID NOS: 15 and 16. 

7 . A recombinant plasmid comprising the 
nucleic acid sequences of any of Claims 5 or 6 . 

8. A host cell comprising the plasmid of 

Claim 7 . 

9. A process for the production of a human 
antibody specific for RSV comprising culturing the host 
cell of Claim 8 in a medium under suitable conditions of 
time temperature and pH and recovering the antibody so 
produced. 

10 . A method of detecting RSV comprising 
contacting a source suspected of containing RSV with a 
diagnostically effective amount of the monoclonal 
antibody of Claim 1 and determining whether the 
monoclonal antibody binds to the source. 

11. A method for providing passive 
immunotherapy to RSV disease in a human, comprising 
administering to the human an immuno therapeutically 
effective amount of the monoclonal antibody of Claim 1. 

12 . The method according to Claim 11 wherein 
the passive immunotherapy is provided prophylactically . 

13 . A pharmaceutical composition comprising 
at least one dose of an immunotherapeutically effective 
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amount of the monoclonal antibody of Claim 1 in a 
pharmaceutically acceptable carrier . 



14. A pharmaceutical composition comprising 
at least one dose of an immunotherapeutically effective 
amount of the monoclonal antibody of Claim 1 in 
combination with at least one additional monoclonal 
antibody . 

15. The pharmaceutical composition according 
to Claim 14 wherein said additional monoclonal antibody 
is an anti-RSV antibody distinguished from the antibody 
of Claim 1 by virtue of being reactive with a different 
epitope of the RSV F protein antigen. 
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Fig. lA 

RSV19/GI1 scFv phage competition 
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Fig. 2 



Neutralisation of RSA//273 witli phage Fv 




a.r.u. (xlO^) 



G lambda 3 G lambda 1 G kappa 1 
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FIGURE 3 

1 CAGTCTGTGTTGACGCAGCCGCCCTCAGTCTCTGCGGCCCCAGGACAGAA 5 0 

QSVLTQPPSVSAAPGQK 



5 1 GGTCACCATCTCCTGCACTGGGAGCAGCTCCAACCTCGGGGCAGGTTATG 10 0 
VTISCTGSSSNLGAGYD 



101 ATGTTCACTGGTACCGGCAACTTCCAGGGACAGCCCCCAAACTCCTCATC 15 0 
VHWYRQLPGTAPKLLI 



151 TATGATAACAACAATCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTC 2 0 0 
YDNNNRPSGVPDRFSGS 



2 01 CAAGTCTGGCCCCTCAGCCTCCCTGGCCATCTCTGGGCTCCAGGCTGAGG 2 5 0 
KSGPSASLAISGLQAED 



251 ATGAGGCTGATTATTACTGCCAGTCCTATGACAGCAGCCTGAATGGTTAT 3 00 
EADYYCQSYDSSLNGY 



3 01 GTCTTCGGAACTGGGACCCAGCTCACCGTCCTAGGT 
VFGTGTQLTVLG 
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FIGURE 4 



1 GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTACAGCCTGGGGGGTC 5 0 

EVQLVESGGGLVQPGGS 



5 1 CCTGAGACTCTCCTGCGCAGCCTCTGGAGTCTCCCTCAGTGGATACAAGA 10 0 
LRLSCAASGVSLSGYKM 



101 TGAACTGGGTCCGCCAGGCTCCAGGGAAGGGGCTGGAATGGGTCTCTTCC 15 0 
NWVRQAPGKGLEWVSS 



151 ATTACTGGTATGAGTAATTACATACACTACTCAGACTCAGTGAAGGGCCG 2 0 0 
ITGMSNYIHYSDSVKGR 



2 01 ATTCACCATCTCCAGAGACAACGCCATGAACTCACTGTATCTGCAAATGA 2 5 0 
FTI SRDNAMNSLYLQMN 



2 51 ACAGCCTGACAGCCGAGGAC ACGGGTGTTTATTATTGTGCGACACAACCG 3 0 0 
SLTAEDTGVYYCATQP 



3 01 GGGGAGCTGGCGCCTTTTGACCATTGGGGCCAGGGAACCCTGGTCACCGT 3 50 
GELAPFDHWGQGTLVTV 



3 51 CTCCTCA 
S S 



357 
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Figure 5 
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FIGURE 6 

Comparison of the Heavy Chain Amino Acid Seqiiences of the 
GX-I single chain fv and mAbs 

Leader and Variable Regions 



GL Dp58: EVQLVESGGGLVQPGGSLRLSCAASGFTFS 

G>.-1 scFv: VSL- 

GX- 1A : MGWSCI ILFLVATATGVHS L 

GX-IB: V 

CDRl CDR2 

GL Dp58: SYEMNWVRQAPGKGLEWVSYISSSGSTIYYADSVKGRFTISRDNAKNSLY 

G>.-1 scFV: G-K S-TGMSNY-H-S M 

GA,-1A: 

GA--1B: 

CDR3 

GL: Dp58: LQMNSLRAEDTAVYYCAR 

GX-1 scFv: T G TQPGEIiAPFDHWGQGTLVTVSS 

G?i-1A: 

GX-IB: 
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FIG 7 

Comparison of the Light Chain Amino Acid Sequences of the G^-IA: 
single chain Fv and mAbs 

Leader and Variable Regions 

CDRl 



GL DpL8: QSVLTQPPSVSGAPGQRVTISCTGSSSNIG 

G?i-1 scFv: A K L- 

G?l-1A: MGfWSCIILFLVATATGVHS E 

GX-IB: QSV 

CDR2 

GL DpL8: AGYDVHWYQQLPGTAPKLLIYGNSNRPSGVPDRFSGSKSGTSASLAITGL 

G>L-lscFv: R D-N P S — 

g;^-ia: 

G?c-1B: 

CDR3 

GL DpL8: QAEDEADYYC 

QX-\ scFv: QSYDSSLNGYVFGTGTQLTVLG 

G>t-1A: 

G?L-1B: 
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FIGURE 8 A 

1 gacgtcgcggccgctctaggcc tccaaaaaagcc tec tcac tact tctgg 

51 aatagctcagaggccgaggcggcc tcggcctc tgcataaataaaaaaaat 

101 tagtcagccatgcatggggcggagaatgggcggaactgggcggagttagg 

151 ggcgggatgggcggagttaggggcgggactatggttgctgactaattgag 

2 01 atgcatgctttgcatacttctgcctgctggggagcctggggactttccac 
251 acctggttgctgactaattgagatgcatgctttgcatacttctgcctgct 

3 01 ggggagcctggggactttccacaccctaactgacacacattccacagaat 
3 51 taattcccggggatcgatccgtcgacgtacgactagttattaatagtaat 
401 caattacggggtcattagttcatagcccatatatggagttccgcgttaca 
451 taacttacggtaaatggcccgcctggctgaccgcccaacgacccccgccc 
5 01 attgacgtcaataatgacgtatgttcccatagtaacgccaatagggactt 
551 tccattgacgtcaatgggtggactatttacggtaaactgcccacttggca 
601 gtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatga 
651 cggtaaatggcccgcctggcattatgcccagtacatgaccttatgggact 
7 01 t t cc tacttggcagtacatctacg tat tag tcatcgctattaccatggtg 
751 atgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacg 
801 gggatttccaagtctccaccccattgacgtcaatgggagtttgttttggc 
851 accaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattg 
9 01 acgcaaatgggcggtaggcgtgtacggtgggaggtc tatataagcagagc 

EcoRI 

951 tgggtacgtgaaccgtcagatcgcctggagacgccatcgaa^ttctgagca 

10 01 cacaggacctcacc atg ggatggagctgtatcatcctcttcttggtagca 

MGWSCIILFLVA 
Leader start 

Xhol 

1051 acagctacaggtgtccactccgaggtccaactgc_tcgagtctgggggagg 

T A T G V H S E V Q L L E S 

Processed N-term 
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FIGURE 8B 

1101 cttggtacagcctggggggtccctgagactctcctgcgcagcctctggag 

1151 tctccctcagtggatacaagatgaactgggtccgccaggctccagggaag 

12 01 gggctggaatgggtctcttccattactggtatgagtaattacatacacta 
1251 c tcagactcagtgaagggccgattcaccatctccagagacaacgccatga 

13 01 actcactgtatctgcaaatgaacagcctgacagccgaggacacgggtgtt 

13 51 tattattgtgcgacacaaccgggggagctggcgccttttgaccattgggg 

BstEII Bspl20l 

14 01 ccagggaaccct ggtcacc gtctcctcagcctccaccaa gggccc atcgg 

QGTLVTV S S/ 

framework IV / CHI 

1451 tcttccccctggcaccctcctccaagagcacctctgggggcacagcggcc 

1501 ctgggctgcctggtcaaggactacttccccgaaccggtgacggtgtcgtg 

1551 gaactcaggcgccctgaccagcggcgtgcacaccttcccggc tgtcc tac 

BstEII 

1601 agtcctcaggactctactccctcagcagcgtggtgaccgtgccctccagc 

1651 agcttgggcacccagacctacatctgcaacgtgaatcacaagcccagcaa 

17 01 caccaaggtggacaagaaagttgagcccaaatcttgtgacaaaactcaca 
1751 catgcccaccgtgcccagcacctgaactcctggggggaccgtcagtcttc 

18 01 ctcttccccccaaaacccaaggacaccctcatgatctcccggacccctga 
1851 ggtcacatgcgtggtggtggacgtgagccacgaagaccctgaggtcaagt 

19 01 tcaactggtacgtggacggcgtggaggtgcataatgccaagacaaagccg 
1951 cgggaggagcagtacaacagcacgtaccgggtggtcagcgtcctcaccgt 
2 001 cctgcaccaggactggctgaatggcaaggagtacaagtgcaaggtctcca 
2 051 acaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggg 
2101 cagccccgagaaccacaggtgtacaccctgcccccatcccgggatgagct 
2151 gaccaagaaccaggtcagcc tgacctgcctggtcaaaggcttc tatccca 
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FIGURE 8C 

22 01 gcgacatcgccgtggagtgggagagcaatgggcagccggagaacaactac 

22 51 aagaccacgcc tcccgtgctggactccgacggctccttcttcctctacag 
2 3 01 caagc tcaccgtggacaagagcaggtggcagcaggggaacgtcttctcat 

23 51 gc tccgtgatgcatgaggc tctgcacaaccactacacgcagaagagcctc 

2 4 01 tccctgtctccgggtaaatgatagatatctacgtatgatcagcctcgac t 

S P G K * C-term of heavy chain 

2451 gtgccttctagttgccagccatctgttgtttgcccctcccccgtgccttc 

25 01 cttgaccc tggaaggtgccactcccactgtcctttcctaataaaatgagg 

2 551 aaattgcatcgcattgtctgagtaggtgtcattctattctggggggtggg 

2 6 01 gtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgc 

2 651 tggggatgcggtgggctctatggaaccagctggggctcgacagcgctgga 

27 01 tctcccgatccccagctttgcttctcaatttcttatttgcataatgagaa 

2751 aaaaaggaaaattaattttaacaccaattcagtagttgattgagcaaatg 

2 801 cgttgccaaaaaggatgctttagagacagtgttctctgcacagataagga 

2 851 caaacattattcagagggagtacccagagctgagactcctaagccagtga 

2 901 gtggcacagcattc tagggagaaatatgcttgtcatcaccgaagcctgat 

2 951 tccgtagagccacaccttggtaagggccaatctgctcacacaggatagag 

3 001 agggcaggagccagggcagagcatataaggtgaggtaggatcagttgctc 
3 051 ctcacatttgcttctgacatagttgtgttgggagcttggatagcttggac 
3101 agctcagggctgcgatttcgcgccaaacttgacggcaatcctagcgtgaa 
3151 ggctggtaggattttatccccgctgccatcatggttcgaccattgaactg 
32 01 catcgtcgccgtgtcccaaaatatggggattggcaagaacggagacctac 

32 51 cctggcctccgc tcaggaacgagttcaagtacttccaaagaatgaccaca 
3 3 01 acctcttcagtggaaggtaaacagaatctggtgattatgggtaggaaaac 

33 51 ctggttctccattcctgagaagaatcgacctttaaaggacagaattaata 
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FIGURE 8D 

3 401 tagttc tcagtagagaactcaaagaaccaccacgaggagc tcattttctt 

3 451 gccaaaagtttggatgatgccttaagacttattgaacaaccggaattggc 

3 501 aagtaaagtagacatggtttggatagtcggaggcagttc tgtttaccagg 

3 551 aagccacgaatcaaccaggccaccttagac tc tttgtgacaaggatcatg 

3 6 01 caggaatttgaaagtgacacgttt ttcccagaaattgatttggggaaata 

3 651 taaacttctcccagaatacccaggcgtcctctc tgaggtccaggaggaaa 

3 7 01 aaggcatcaagtataagtttgaagtctacgagaagaaagactaacaggaa 

3 751 gatgctttcaagttc tctgctcccctcc taaagctatgcatttttataag 

3 801 accatgggacttttgctggctttagatcagcctcgactgtgccttctagt 

3 851 tgc cage cat ctgttgtttgcccctcccccgtgccttccttgaccctgga 

3 901 aggtgccactcccac tgtcctttcctaataaaatgaggaaattgcatcgc 

3 951 attgtctgagtaggtgtcattctattctggggggtggggtggggcaggac 
40 01 agcaagggggaggattgggaagacaatagcaggcatgctggggatgcggt 

4 051 gggctctatggaaccagctggggctcgatcgagtgtatgactgcggccgc 
4101 gatcccgtcgagagcttggcgtaatcatggtcatagctgtttcctgtgtg 
4151 aaattgttatccgctcacaattccacacaacatacgagccggaagcataa 

42 01 agtgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcg 
4251 ttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgca 

43 01 ttaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgct 
43 51 cttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcgg 
4401 cgagcggtatcagc tcac tcaaaggcggtaatacggttatccacagaatc 
4451 aggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggcca 
4501 ggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccc 
4551 cctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaaccc 
4601 gacaggactataaagataccaggcgtttccccctggaagctccctcgtgc 
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FIGURE 8E 

4 651 gctc tcctgttccgaccc tgccgc ttaccggatacctgtccgcctttctc 

47 01 ccttcgggaagcgtggcgctttc tcaatgc tcacgc tgtaggtatctcag 
4751 ttcggtgtaggtcgttcgctccaagc tgggctgtgtgcacgaaccccccg 

48 01 ttcagcccgaccgctgcgccttatccggtaactatcgtc ttgagtccaac 
4851 ccggtaagacacgacttatcgccac tggcagcagccac tggtaacaggat 
4901 tagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggc 
4951 ctaactacggctacactagaaggacagtatttggtatctgcgc tctgctg 

5 0 01 aagccagttaccttcggaaaaagagttggtagctcttgatccggcaaaca 
5051 aaccaccgctggtagcggtggtttt tttgtttgcaagcagcagattacgc 
5101 gcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtct 
5151 gacgctcagtggaacgaaaactcacgttaagggattttggtcatgagatt 

52 01 atcaaaaaggatcttcacctagatccttttaaattaaaaatgaagtttta 
5251 aatcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgc 

53 01 ttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccat 
53 51 agttgcctgactccccgtcgtgtagataactacgatacgggagggcttac 
5401 catctggccccagtgctgcaatgataccgcgagacccacgctcaccggct 
5451 ccagatttatcagcaataaaccagccagccggaagggccgagcgcagaag 
5501 tggtcctgcaactttatccgcctccatccagtctattaattgttgccggg 
5551 aagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgcc 
5601 attgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcatt 
5 651 cage tc egg ttcccaacgatcaaggcgagttacatgatcccccatgttgt 
5701 gcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaag 
5751 ttggccgcagtgttatcaetcatggttatggcagcactgcataattctc t 
5801 taetgtcatgccatccgtaagatgcttttetgtgaetggtgagtactcaa 
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FIGURE 8F 

5851 ccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccg 

5901 gcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgct 

5951 catcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgc 

60 01 tgttgagatccagttcgatgtaacccactcgtgcacccaactgatcttca 

6 051 gcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggca 

6101 aaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactca 

6151 tactcttcctttttcaatattattgaagcatttatcagggttattgtctc 

62 01 atgagcggatacatatttgaatgtatttagaaaaataaacaaataggggt 

62 51 tccgcgcacatttccccgaaaagt-gccacc t 
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FIGURE 9A 

1 gacgtcgcggccgctctaggcctccaaaaaagcctcctcaccacttctgg 

51 aatagctcagaggccgaggcggcctcggcctctgcataaataaaaaaaat 

101 tagtcagccatgcatggggcggagaatgggcggaactgggcggagttagg 

151 ggcgggatgggcggagttaggggcgggactatggttgctgactaattgag 

2 01 atgcatgctttgcatacttctgcctgctggggagcctggggactttccac 

2 51 acctggttgctgac taattgagatgcatgctttgcatacttctgcctgct 

3 01 ggggagcctggggactttccacaccctaactgacacacattccacagaat 
3 51 taattcccggggatcgatccgtcgacgtacgactagttattaatagtaat 
401 caattacggggtcattagttcatagcccatatatggagttccgcgttaca 
451 taacttacggtaaatggcccgcctggctgaccgcccaacgacccccgccc 
501 attgacgtcaataatgacgtatgttcccatagtaacgccaatagggactt 
551 tccattgacgtcaatgggtggactatttacggtaaactgcccacttggca 
601 gtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatga 
651 cggtaaatggcccgcctggcattatgcccagtacatgaccttatgggact 
7 01 ttcctacttggcagtacatctacgtattagtcatcgctattaccatggtg 
751 atgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacg 
801 gggatttccaagtctccaccccattgacgtcaatgggagtttgttttggc 
851 accaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattg 
9 01 acgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagc 

EcoRI 

951 tgggtacgtgaaccgtcagatcgcctggagacgccatcgaatjtctgagca 

1001 cacaggacctcacc atg ggatggagctgtatcatcctcttcttggtagca 

MGWSCIIL.FLVA 
Leader start 

Sad 

1051 acagctacaggtgtccactcc gagctc acgcagccgccctcagtctctgc 
T A T G V H S E L T Q 

Processed N-term 
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FIGURE 9B 

1101 ggccccaggacagaaggtcaccatctcctgcactgggagcagctccaacc 
1151 tcggggcaggttatgatgttcactggtaccggcaacttccagggacagcc 
12 01 cccaaactcctcatctatgataacaacaatcggccctcaggggtccctga 

12 51 ccgattctctggc tccaagtctggcccctcagcctccctggccatctctg 

13 01 ggctccaggctgaggatgaggctgattattactgccagtcc tatgacagc 

Avrll 

13 51 agcctgaatggttatgtcttcggaactgggacccagctcaccgtcctagg 

T Q L T V L G 

Framework IV / CX 

1401 tcagcccaaggctgccccctcggtcactctgttcccgccctcctctgagg 

1451 agcttcaagccaacaaggccacactggtgtgtc tcataagtgacttctac 

15 01 ccgggagccgtgacagtggcctggaaggcaattagcagccccgtcaaggc 
1551 gggagtggagaccaccacaccctccaaacaaagcaacaacaagtacgcgg 

16 01 ccagcagctatctgagcctgacgcctgagcagtggaagtcccacagaagg 
1651 tacagctgccaggtcacgcatgaagggagcaccgtggagaagacagtggc 

17 01 ccctacagaatgttcatagttctagatctacgtatgatcagcctcgactg 

P T E C S * C-term light chain 

17 51 tgccttctagttgccagccatctgttgtttgcccctcccccgtgccttcc 

1801 ttgaccctggaaggtgccactcccactgtcctttcctaataaaatgagga 

1851 aattgcatcgcattgtctgagtaggtgtcattctattctggggggtgggg 

19 01 tggggcaggacagcaagggggaggattgggaagacaatagcaggcatgct 

1951 ggggatgcggtgggctctatggaaccagctggggctcgacagctcgagct 

2 001 agctttgcttctcaatttcttatttgcataatgagaaaaaaaggaaaatt 

2 051 aattttaacaccaattcagtagttgattgagcaaatgcgttgccaaaaag 

2101 gatgctttagagacagtgttctctgcacagataaggacaaacattattca 

2151 gagggagtacccagagctgagactcctaagccagtgagtggcacagcatt 
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FIGURE 9C 

22 01 ctagggagaaatatgcttgtcatcaccgaagcctgattccgtagagccac 

22 51 accttggtaagggccaatctgctcacacaggatagagagggcaggagcca 

23 01 gggcagagcatataaggtgaggtaggatcagt tgc tec tcacatttgctt 

23 51 ctgacatagttgtgttgggagc ttggatcgatccaccatggttgaacaag 

24 01 atggattgcacgcaggttctccggccgcttgggtggagaggctattcggc 
2451 tatgactgggcacaacagacaatcggctgc tctgatgccgccgtgttccg 
2501 gctgtcagcgcaggggcgcccggttctttttgtcaagaccgacctgtccg 
2 551 gtgccc tgaatgaactgcaggacgaggcagcgcggc tatcgtggctggcc 
2 601 acgacgggcgttccttgcgcagctgtgctcgacgttgtcac tgaagcggg 
2 651 aagggactggctgctattgggcgaagtgccggggcaggatctcctgtcat 
2701 ctcaccttgctcctgccgagaaagtatccatcatggctgatgcaatgcgg 

2 7 51 cggctgcatacgcttgatccggctacctgcccattcgaccaccaagcgaa 
2801 acatcgcatcgagcgagcacgtactcggatggaagccggtcttgtcgatc 
2851 aggatgatctggacgaagagcatcaggggctcgcgccagccgaactgttc 
29 01 gccaggctcaaggcgcgcatgcccgacggcgaggatc tcgtcgtgaccca 
2951 tggcgatgcctgcttgccgaatatcatggtggaaaatggccgcttttctg 

3 001 gattcatcgactgtggccggctgggtgtggcggaccgctatcaggacata 
3 051 gcgttggctacccgtgatattgctgaagagcttggcggcgaatgggctga 
3101 ccgcttcctcgtgctttacggtatcgccgctcccgattcgcagcgcatcg 
3151 ccttctatcgccttcttgacgagttcttctgagcgggactctggggttcg 
32 01 aaatgaccgaccaagcgacgcccaacctgccatcacgagatttcgattcc 

32 51 accgccgccttctatgaaaggttgggcttcggaatcgttttccgggacgc 

33 01 cggctggatgatcctccagcgcggggatctcatgctggagttcttcgccc 
33 51 accccaacttgtttattgcagcttataatggttacaaataaagcaatagc 
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FIGURE 9D 

34 01 atcacaaatttcacaaataaagcatttttttcactgcattctagttgtgg 

3 451 tttgtccaaactcatcaatgtatcttatcatgtctggatcgcggccgcga 

3 5 01 tcccgtcgagagcttggcgtaatcatggtcatagc tgtttcctgtgtgaa 

3 551 attgttatccgctcacaattccacacaacatacgagccggaagcataaag 

3 501 tgtaaagcctggggtgcc taatgagtgagctaactcacattaattgcgtt 

3 651 gcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcatt 

3 7 01 aatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctct 

3 751 tccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcg 

3 8 01 agcggtatcagctcactcaaaggcggtaatacggttatccacagaatcag 

3851 gggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccagg 

39 01 aaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccc 
3 951 tgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccga 

40 01 caggactataaagataccaggcgtttccccc tggaagctccc tcgtgcgc 
4051 tctcctgttccgaccctgccgcttaccggatacctgtccgcctttctccc 
4101 ttcgggaagcgtggcgctttctcaatgctcacgctgtaggtatctcagtt 
4151 cggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgtt 

42 01 cagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaaccc 
4251 ggtaagacacgacttatcgccactggcagcagccactggtaacaggatta 

43 01 gcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcct 
4351 aactacggctacactagaaggacagtatttggtatctgcgctctgctgaa 
4401 gccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaa 
4451 ccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgc 
4501 agaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctga 
4551 cgctcagtggaacgaaaactcacgttaagggattttggtcatgagattat 
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FIGURE 9E 

46 01 caaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaa 
4651 tcaatctaaagtatatatgagtaaacttggtctgacagttaccaacgctt 

47 01 aatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatag 
4751 ttgcctgactccccgtcgtgtagataactacgatacgggagggcttacca 
4801 tctggccccagtgctgcaatgataccgcgagacccacgctcaccggctcc 
4851 agatttatcagcaataaaccagccagccggaagggccgagcgcagaagtg 

49 01 gtcctgcaactttatccgcctccatccagtctattaattgttgccgggaa 
4951 gctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccat 

50 01 tgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattca 
5 051 gctccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgc 
5101 aaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagtt 
5151 ggccgcagtgttatcactcatggttatggcagcactgcataattctctta 
52 01 ctgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaacc 

52 51 aagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggc 

53 01 gtcaatacgggataataccgcgccacatagcagaactttaaaagtgctca 
53 51 tcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctg 
5401 ttgagatccagttcgatgtaacccactcgtgcacccaactgatcttcagc 
5451 atcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaa 
5501 atgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcata 
5551 c tc t tec tttttcaa tat tattgaagcatttatcagggt tat tgtc teat 
5 601 gagcggatacatatttgaatgtatttagaaaaataaacaaataggggttc 
5651 cgcgcacatttccccgaaaagtgccacct 
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FIGURE lOA 

EcoRI 

gaattc tgagca 1000 

cacaggacct caec a tg ggatggagctgtatcatcctcttcttggtagca 105 0 

MGWSCIILFLVA 

acagctacaggtgtccactccgaggtgcagctggtggagtctgggggagg 110 0 

T A T G V H S E V Q L V E S - 

N- term 

cttggtacagcctggggggtccctgagactctcctgcgcagcctctggag 1150 

tctccctcagtggatacaagatgaactgggtccgccaggctccagggaag 12 0 0 

gggctggaatgggtctcttccattac tggtatgagtaattacatacacta 12 5 0 

ctcagactcagtgaagggccgattcaccatctccagagacaacgccatga 13 0 0 

actcactgtatctgcaaatgaacagcctgacagccgaggacacgggtgtt 13 50 

tattattgtgcgacacaaccgggggagctggcgccttttgaccattgggg 140 0 

Bspl20l 

ccagggaaccctggtcaccgtctcctcagcctccaccaagggc£catcgg 145 0 

tcttccccctggcaccctcctccaagagcacctctgggggcacagcggcc 15 0 0 

ctgggctgcctggtcaaggactacttccccgaaccggtgacggtgtcgtg 1550 

gaactcaggcgccctgaccagcggcgtgcacaccttcccggctgtcctac 160 0 

agtcctcaggactctactccctcagcagcgtggtgaccgtgccctccagc 165 0 

agcttgggcacccagacctacatctgcaacgtgaatcacaagcccagcaa 170 0 

caccaaggtggacaagaaagttgagcccaaatcttgtgacaaaactcaca 175 0 

catgcccaccgtgcccagcacctgaactcctggggggaccgtcagtcttc 18 0 0 

ctcttccccccaaaacccaaggacaccctcatgatctcccggacccctga 1850 

ggtcacatgcgtggtggtggacgtgagccacgaagaccctgaggtcaagt 19 0 0 

tcaactggtacgtggacggcgtggaggtgcataatgccaagacaaagccg 195 0 

cgggaggagcagtacaacagcacgtaccgggtggtcagcgtcctcaccgt 2 0 00 

cctgcaccaggactggctgaatggcaaggagtacaagtgcaaggtctcca 2 05 0 
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FIGURE lOB 

acaaagccctcccagcccccatcgagaaaaccatctccaaagccaaaggg 210 0 

cagccccgagaaccacaggtgtacaccctgcccccatcccgggatgagct 215 0 

gaccaagaaccaggtcagcctgacc tgcc tggtcaaaggc ttc tatccca 22 0 0 

gcgacatcgccgtggagtgggagagcaatgggcagccggagaacaactac 22 5 0 

aagaccacgcctcccgtgctggactccgacggctccttcttcctctacag 23 0 0 

caagctcaccgtggacaagagcaggtggcagcaggggaacgtc ttc teat 23 50 

gctccgtgatgcatgaggctctgcacaaccactacacgcagaagagcctc 240 0 

tccctgtctccgggtaaatgatagatatct 
S P G K * 
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FIGURE 11 

EcoRI 

gaattc tgagca 1000 

cacaggacctcaccatgggatggagctgtatcatcctcttcttggtagca 105 0 

MGWSCIILFLVA 

acagctacaggtgtccactcccagtctgtgttgacgcagccgccc tcagt 110 0 
T A T G V H S Q S V L T Q - 

N- term 

ctctgcggccccaggacagaaggtcaccatctcctgcac tgggagcagct 115 0 

ccaacctcggggcaggttatgatgttcactggtaccggcaacttccaggg 12 0 0 

acagcccc caaac tec teat ctatgataacaacaatcggcc etc aggggt 12 5 0 

ccctgaccgattctctggctccaagtctggcccc tcagcctccctggeca 13 0 0 

tctctgggctccaggctgaggatgaggctgattattactgccagtcctat 13 5 0 

gacagcagcctgaatggttatgtctteggaactgggacccagctcaccgt 140 0 
Avrll 

cc_taggteagcccaaggctgccccctcggtcaetetgttcecgccctcct 145 0 

ctgaggagcttcaagceaacaaggccacactggtgtgtctcataagtgac 15 0 0 

ttctacccgggagccgtgacagtggcctggaaggcaattagcagccccgt 155 0 

caaggegggagtggagaccaccacaccctccaaacaaagcaacaacaagt 160 0 

aegcggceagcagctatctgagcctgacgeetgagcagtggaagtcccac 165 0 

agaaggtacagctgccaggtcacgcatgaagggagcaccgtggagaagac 17 0 0 

a-gtggcccctacagaatgttca tag ttctagatctacgtatgatcagcct 17 5 0 

P T E C S * 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: SmithKline Beechara, PLC 
(ii) TITLE OF INVENTION: Human Monoclonal Antibody 
(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 7 09 Swedeland Road 

(C) CITY: King of Prussia 

( D ) STATE : PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-2799 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 

(B) FILING DATE: 

{ C ) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: King, William T. 

(B) REGISTRATION NUMBER: 30,954 

(C) REFERENCE /DOCKET NUMBER: # 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4800 

(B) TELEFAX: 610-270-4026 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAG TCT GTG TTG ACG CAG CCG CCC TCA GTC TCT GCG GCC CCA GGA CAG 4 8 
Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 
15 10 15 

AAG GTC ACC ATC TCC TGC ACT GGG AGC AGC TCC AAC CTC GGG GCA GGT 9 6 



wo 00/69462 2 PCT/USOO/13694 

Lys Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 

2 0 2 5 3 0 

TAT GAT GTT CAC TGG TAG CGG CAA CTT CCA GGG ACA GCC CCC AAA CTC 144 

Tyr Asp Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 

35 40 45 

CTC ATC TAT GAT AAC AAC AAT CGG CCC TCA GGG GTC CCT GAC CGA TTC 192 

Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 

50 55 60 

TCT GGC TCC AAG TCT GGC CCC TCA GCC TCC CTG GCC ATC TCT GGG CTC 24 0 

Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 

65 70 75 80 

CAG GCT GAG GAT GAG GCT GAT TAT TAG TGC CAG TCC TAT GAC AGC AGC 2 88 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 

85 90 95 

CTG AAT GGT TAT GTC TTC GGA ACT GGG ACC CAG CTC ACC GTC CTA GGT 33 6 

Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu Gly 

100 105 110 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 
15 10 15 

Lys Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 

20 25 30 

Tyr Asp Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 

35 40 45 

Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 
50 55 60 

Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 
65 70 75 80 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 

85 90 95 

Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu Gly 

100 105 110 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..3 57 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAG GTG CAG CTG GTG GAG TCT GGG GGA GGC TTG GTA CAG CCT GGG GGG 4 8 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

TCC CTG AGA CTC TCC TGC GCA GCC TCT GGA GTC TCC CTC AGT GGA TAC 96 
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu Ser Gly Tyr 

20 25 30 

AAG ATG AAC TGG GTC CGC CAG GCT CCA GGG AAG GGG CTG GAA TGG GTC 144 
Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 

35 40 45 

TCT TCC ATT ACT GGT ATG AGT AAT TAC ATA CAC TAC TCA GAC TCA GTG 192 
Ser Ser lie Thr Gly Met Ser Asn Tyr lie His Tyr Ser Asp Ser Val 
50 55 60 

AAG GGC CGA TTC ACC ATC TCC AGA GAC AAC GCC ATG AAC TCA CTG TAT 240 
Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met Asn Ser Leu Tyr 
65 70 75 80 

CTG CAA ATG AAC AGC CTG ACA GCC GAG GAC ACG GGT GTT TAT TAT TGT 2 88 

Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val Tyr Tyr Cys 

85 90 95 

GCG ACA CAA CCG GGG GAG CTG GCG CCT TTT GAC CAT TGG GGC CAG GGA 33 6 

Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp Gly Gin Gly 

100 105 110 

ACC CTG GTC ACC GTC TCC TCA 3 57 

Thr Leu Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 

15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu Ser Gly Tyr 

20 25 30 



Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
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45 



Ser Ser lie Thr Gly Met Ser Asn Tyr lie His Tyr Ser Aso Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met Asn Ser Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val Tyr Tyr Cys 

85 90 ' 95 

Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp Gly Gin Gly 

100 105 110 

Thr Leu Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
1 5 10 15 * 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu Ser Gly T\'r 

20 25 30 

Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ser Ser lie Thr Gly Met Ser Asn Tyr lie His Tyr Ser Asp Ser Val 

50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met Asn Ser Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val Tyr Tyr Cys 

85 90 95 

Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp Gly Gin Gly 

100 105 ' 110 

Thr Leu Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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{ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Ser Tyr 

20 25 30 

Glu Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ser Tyr lie Ser Ser Ser Gly Ser Thr lie Tyr Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr 

65 70 75 80 

Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 

85 90 95 

Ala Arg 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE 

Met Gly Trp Ser 
1 

Val His Ser Glu 

20 

Pro Gly Gly Ser 
35 

Ser Gly Tyr Lys 
50 

Glu Trp Val Ser 
65 

Asp Ser Val Lys 



ICRIPTION: SEQ I 

Cys lie lie Leu 
5 

Val Gin Leu Leu 



Leu Arg Leu Ser 

40 

Met Asn Trp Val 
55 

Ser lie Thr Gly 
70 

Gly Arg Phe Thr 
85 



I NO : 7 : 

Phe Leu Val Ala 
10 

Glu Ser Gly Gly 
25 

Cys Ala Ala Ser 



Arg Gin Ala Pro 

60 

Met Ser Asn Tyr 
75 

lie Ser Arg Asp 
90 



Thr Ala Thr Gly 
15 

Gly Leu Val Gin 
30 

Gly Val Ser Leu 
45 

Gly Lys Gly Leu 



lie His Tyr Ser 

80 

Asn Ala Met Asn 
95 



Ser Leu Tyr Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val 
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110 



Tyr Tyr Cys Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp 

115 120 125 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr Gly 
15 10 15 

Val His Ser Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin 

20 25 30 

Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu 
35 40 45 

Ser Gly Tyr Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
50 55 60 

Glu Trp Val Ser Ser lie Thr Gly Met Ser Asn Tyr lie His Tyr Ser 
65 70 75 80 

Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met Asn 

85 90 95 

Ser Leu Tyr Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val 

100 105 110 

Tyr Tyr Cys Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp 
115 120 125 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TO POLOG Y : unknown 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
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Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 

15 10 15 

Lys Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 

20 25 30 

Tyr Asp Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 
35 40 45 

Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 
50 55 60 

Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 
65 70 75 80 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 

85 90 95 

Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu 

100 105 110 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Gly Ala Pro Gly Gin 
15 10 15 

Arg Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn He Gly Ala Gly 

20 25 30 

Tyr Asp Val His Trp Tyr Gin Gin Leu Pro Gly Thr Ala Pro Lys Leu 
35 40 45 

Leu He Tyr Gly Asn Ser Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 
50 55 60 

Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala He Thr Gly Leu 
65 70 75 80 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys 

85 90 

(2) INFORMATION FOR SEQ ID NO ill: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr Gly 
15 10 15 

Val His Ser Glu Leu Thr Gin Pro Pro Ser Val Ser Gly Ala Pro Gly 

20 25 30 

Gin Arg Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn lie Gly Ala 
35 40 45 

Gly Tyr Asp Val His Trp Tyr Gin Gin Leu Pro Gly Thr Ala Pro Lys 
50 55 60 

Leu Leu lie Tyr Gly Asn Ser Asn Arg Pro Ser Gly Val Pro Asp Arg 
65 70 75 80 

Phe Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala lie Thr Gly 

85 90 95 

Leu Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser 

100 105 110 

Ser Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE 

Met Gly Trp Ser 
1 

Val His Ser Gin 

20 

Pro Gly Gin Arg 
35 

Gly Ala Gly Tyr 
50 

Pro Lys Leu Leu 
65 



CRIPTION: SEQ I 

Cys lie lie Leu 
5 

Ser Val Leu Thr 



Val Thr lie Ser 

40 

Asp Val His Trp 
55 

lie Tyr Gly Asn 
70 



) N0:12 : 

Phe Leu Val Ala 
10 

Gin Pro Pro Ser 
25 - 

Cys Thr Gly Ser 



Tyr Gin Gin Leu 

60 

Ser Asn Arg Pro 
75 



Thr Ala Thr Gly 
15 

Val Ser Gly Ala 
30 

Ser Ser Asn lie 
45 

Pro Gly Thr Ala 



Ser Gly Val Pro 

80 
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Asp Arg Phe Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala lie 

85 90 95 

Thr Gly Leu Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr 

100 105 110 

Asp Ser Ser Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr 
115 120 125 

Val Leu 
130 

(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 6 0 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 12 0 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 2 40 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCGG 3 60 

GGATCGATCC GTCGACGTAC GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 42 0 

CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 480 

CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 54 0 

ATAGGGACTT TCCATTGACG TCAATGGGTG GACTATTTAC GGTAAACTGC CCACTTGGCA 6 00 

GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 660 

CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 72 0 

TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 7 80 

GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 840 

TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 90 0 

ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC TGGGTACGTG 9 60 

AACCGTCAGA TCGCCTGGAG ACGCCATCGA ATTCTGAGCA CACAGGACCT CACCATGGGA 102 0 

TGGAGCTGTA TCATCCTCTT CTTGGTAGCA ACAGCTACAG GTGTCCACTC CGAGGTCCAA 10 8 0 
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CTGCTCGAGT CTGGGGGAGG CTTGGTACAG CCTGGGGGGT CCCTGAGACT CTCCTGCGCA 114 0 

GCCTCTGGAG TCTCCCTCAG TGGATACAAG ATGAACTGGG TCCGCCAGGC TCCAGGGAAG 12 0 0 

GGGCTGGAAT GGGTCTCTTC CATTACTGGT ATGAGTAATT ACATACACTA CTCAGACTCA 12 6 0 

GTGAAGGGCC GATTCACCAT CTCCAGAGAC AACGCCATGA ACTCACTGTA TCTGCAAATG 132 0 

AACAGCCTGA CAGCCGAGGA CACGGGTGTT TATTATTGTG CGACACAACC GGGGGAGCTG 13 8 0 

GCGCCTTTTG ACCATTGGGG CCAGGGAACC CTGGTCACCG TCTCCTCAGC CTCCACCAAG 144 0 

GGCCCATCGG TCTTCCCCCT GGCACCCTCC TCCAAGAGCA CCTCTGGGGG CACAGCGGCC 15 0 0 

CTGGGCTGCC TGGTCAAGGA CTACTTCCCC GAACCGGTGA CGGTGTCGTG GAACTCAGGC 15 60 

GCCCTGACCA GCGGCGTGCA CACCTTCCCG GCTGTCCTAC AGTCCTCAGG ACTCTACTCC 162 0 

CTCAGCAGCG TGGTGACCGT GCCCTCCAGC AGCTTGGGCA CCCAGACCTA CATCTGCAAC 168 0 

GTGAATCACA AGCCCAGCAA CACCAAGGTG GACAAGAAAG TTGAGCCCAA ATCTTGTGAC 174 0 

AAAACTCACA CATGCCCACC GTGCCCAGCA CCTGAACTCC TGGGGGGACC GTCAGTCTTC 18 00 

CTCTTCCCCC CAAAACCCAA GGACACCCTC ATGATCTCCC GGACCCCTGA GGTCACATGC 186 0 

GTGGTGGTGG ACGTGAGCCA CGAAGACCCT GAGGTCAAGT TCAACTGGTA CGTGGACGGC 192 0 

GTGGAGGTGC ATAATGCCAA GACAAAGCCG CGGGAGGAGC AGTACAACAG CACGTACCGG 198 0 

GTGGTCAGCG TCCTCACCGT CCTGCACCAG GACTGGCTGA ATGGCAAGGA GTACAAGTGC 2 040 

AAGGTCTCCA ACAAAGCCCT CCCAGCCCCC ATCGAGAAAA CCATCTCCAA AGCCAAAGGG 2100 

CAGCCCCGAG AACCACAGGT GTACACCCTG CCCCCATCCC GGGATGAGCT GACCAAGAAC 216 0 

CAGGTCAGCC TGACCTGCCT GGTCAAAGGC TTCTATCCCA GCGACATCGC CGTGGAGTGG 2 22 0 

GAGAGCAATG GGCAGCCGGA GAACAACTAC AAGACCACGC CTCCCGTGCT GGACTCCGAC 22 80 

GGCTCCTTCT TCCTCTACAG CAAGCTCACC GTGGACAAGA GCAGGTGGCA GCAGGGGAAC 23 4 0 

GTCTTCTCAT GCTCCGTGAT GCATGAGGCT CTGCACAACC ACTACACGCA GAAGAGCCTC 24 0 0 

TCCCTGTCTC CGGGTAAATG ATAGATATCT ACGTATGATC AGCCTCGACT GTGCCTTCTA 246 0 

GTTGCCAGCC ATCTGTTGTT TGCCCCTCCC CCGTGCCTTC CTTGACCCTG GAAGGTGCCA 2 52 0 

CTCCCACTGT CCTTTCCTAA TAAAATGAGG AAATTGCATC GCATTGTCTG AGTAGGTGTC 2 58 0 

ATTCTATTCT GGGGGGTGGG GTGGGGCAGG ACAGCAAGGG GGAGGATTGG GAAGACAATA 2 64 0 

GCAGGCATGC TGGGGATGCG GTGGGCTCTA TGGAACCAGC TGGGGCTCGA CAGCGCTGGA 27 00 

TCTCCCGATC CCCAGCTTTG CTTCTCAATT TCTTATTTGC ATAATGAGAA AAAAAGGAAA 276 0 

ATTAATTTTA ACACCAATTC AGTAGTTGAT TGAGCAAATG CGTTGCCAAA AAGGATGCTT 2 82 0 

TAGAGACAGT GTTCTCTGCA CAGATAAGGA CAAACATTAT TCAGAGGGAG TACCCAGAGC 2 880 

TGAGACTCCT AAGCCAGTGA GTGGCACAGC ATTCTAGGGA GAAATATGCT TGTCATCACC 2 94 0 

GAAGCCTGAT TCCGTAGAGC CACACCTTGG TAAGGGCCAA TCTGCTCACA CAGGATAGAG 3 000 
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AGGGCAGGAG CCAGGGCAGA GCATATAAGG TGAGGTAGGA TCAGTTGCTC CTCACATTTG 3 06 0 

CTTCTGACAT AGTTGTGTTG GGAGCTTGGA TAGCTTGGAC AGCTCAGGGC TGCGATTTCG 312 0 

CGCCAAACTT GACGGCAATC CTAGCGTGAA GGCTGGTAGG ATTTTATCCC CGCTGCCATC 318 0 

ATGGTTCGAC CATTGAACTG CATCGTCGCC GTGTCCCAAA ATATGGGGAT TGGCAAGAAC 3 24 0 

GGAGACCTAC CCTGGCCTCC GCTCAGGAAC GAGTTCAAGT ACTTCCAAAG AATGACCACA 3 30 0 

ACCTCTTCAG TGGAAGGTAA ACAGAATCTG GTGATTATGG GTAGGAAAAC CTGGTTCTCC 3 3 60 

ATTCCTGAGA AGAATCGACC TTTAAAGGAC AGAATTAATA TAGTTCTCAG TAGAGAACTC 3 42 0 

AAAGAACCAC CACGAGGAGC TCATTTTCTT GCCAAAAGTT TGGATGATGC CTTAAGACTT 3480 

ATTGAACAAC CGGAATTGGC AAGTAAAGTA GACATGGTTT GGATAGTCGG AGGCAGTTCT 3 54 0 

GTTTACCAGG AAGCCATGAA TCAACCAGGC CACCTTAGAC TCTTTGTGAC AAGGATCATG 3 60 0 

CAGGAATTTG AAAGTGACAC GTTTTTCCCA GAAATTGATT TGGGGAAATA TAAACTTCTC 3 660 

CCAGAATACC CAGGCGTCCT CTCTGAGGTC CAGGAGGAAA AAGGCATCAA GTATAAGTTT 3 72 0 

GAAGTCTACG AGAAGAAAGA CTAACAGGAA GATGCTTTCA AGTTCTCTGC TCCCCTCCTA 37 8 0 

AAGCTATGCA TTTTTATAAG ACCATGGGAC TTTTGCTGGC TTTAGATCAG CCTCGACTGT 3 840 

GCCTTCTAGT TGCCAGCCAT CTGTTGTTTG CCCCTCCCCC GTGCCTTCCT TGACCCTGGA 3 90 0 

AGGTGCCACT CCCACTGTCC TTTCCTAATA AAATGAGGAA ATTGCATCGC ATTGTCTGAG 3 9 60 

TAGGTGTCAT TCTATTCTGG GGGGTGGGGT GGGGCAGGAC AGCAAGGGGG AGGATTGGGA 4 02 0 

AGACAATAGC AGGCATGCTG GGGATGCGGT GGGCTCTATG GAACCAGCTG GGGCTCGATC 40 8 0 

GAGTGTATGA CTGCGGCCGC GATCCCGTCG AGAGCTTGGC GTAATCATGG TCATAGCTGT 4140 

TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA CATACGAGCC GGAAGCATAA 420 0 

AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC ATTAATTGCG TTGCGCTCAC 42 6 0 

TGCCCGCTTT CCAGTCGGGA AACCTGTCGT GCCAGCTGCA TTAATGAATC GGCCAACGCG 43 2 0 

CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC 43 8 0 

GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 444 0 

CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA 45 00 

GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC 45 6 0 

ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACAGGACTA TAAAGATACC 4 62 0 

AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG 46 80 

GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCAATGC TCACGCTGTA 474 0 

GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG 48 00 

TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC 4 86 0 
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ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG AGGTATGTAG 492 0 

GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG CTACACTAGA AGGACAGTAT 49 8 0 

TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 5 040 

CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC 510 0 

GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT GACGCTCAGT 5160 

GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT ATCAAAAAGG ATCTTCACCT 522 0 

AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT 52 8 0 

GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC TGTCTATTTC 53 40 

GTTCATCCAT AGTTGCCTGA CTCCCCGTCG TGTAGATAAC TACGATACGG GAGGGCTTAC 54 0 0 

CATCTGGCCC CAGTGCTGCA ATGATACCGC GAGACCCACG CTCACCGGCT CCAGATTTAT 54 60 

CAGCAATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA ACTTTATCCG 55 2 0 

CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG CCAGTTAATA 55 80 

GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTGGT GTCACGCTCG TCGTTTGGTA 5 64 0 

TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT TACATGATCC CCCATGTTGT 57 0 0 

GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAG 57 60 

TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT TACTGTCATG CCATCCGTAA 5 82 0 

GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT CTGAGAATAG TGTATGCGGC 58 80 

GACCGAGTTG CTCTTGCCCG GCGTCAATAC GGGATAATAC CGCGCCACAT AGCAGAACTT 5 940 

TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA ACTCTCAAGG ATCTTACCGC 60 0 0 

TGTTGAGATC CAGTTCGATG TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA 606 0 

CTTTCACCAG CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAAAGGGAA 612 0 

TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT TATTGAAGCA 6180 

TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA ATGTATTTAG AAAAATAAAC 62 4 0 

AAATAGGGGT TCCGCGCACA TTTCCCCGAA AAGTGCCACC T 62 81 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5679 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 6 0 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 12 0 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 18 0 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 24 0 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCGG 3 60 

GGATCGATCC GTCGACGTAC GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 42 0 

CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 480 

CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 54 0 

ATAGGGACTT TCCATTGACG TCAATGGGTG GACTATTTAC GGTAAACTGC CCACTTGGCA 6 00 

GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 66 0 

CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 72 0 

TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 780 

GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 84 0 

TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 900 

ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC TGGGTACGTG 9 60 

AACCGTCAGA TCGCCTGGAG ACGCCATCGA ATTCTGAGCA CACAGGACCT CACCATGGGA 102 0 

TGGAGCTGTA TCATCCTCTT CTTGGTAGCA ACAGCTACAG GTGTCCACTC CGAGCTCACG 1080 

CAGCCGCCCT CAGTCTCTGC GGCCCCAGGA CAGAAGGTCA CCATCTCCTG CACTGGGAGC 114 0 

AGCTCCAACC TCGGGGCAGG TTATGATGTT CACTGGTACC GGCAACTTCC AGGGACAGCC 12 0 0 

CCCAAACTCC TCATCTATGA TAACAACAAT CGGCCCTCAG GGGTCCCTGA CCGATTCTCT 12 60 

GGCTCCAAGT CTGGCCCCTC AGCCTCCCTG GCCATCTCTG GGCTCCAGGC TGAGGATGAG 13 2 0 

GCTGATTATT ACTGCCAGTC CTATGACAGC AGCCTGAATG GTTATGTCTT CGGAACTGGG 13 80 

ACCCAGCTCA CCGTCCTAGG TCAGCCCAAG GCTGCCCCCT CGGTCACTCT GTTCCCGCCC 1440 

TCCTCTGAGG AGCTTCAAGC CAACAAGGCC ACACTGGTGT GTCTCATAAG TGACTTCTAC 1500 

CCGGGAGCCG TGACAGTGGC CTGGAAGGCA ATTAGCAGCC CCGTCAAGGC GGGAGTGGAG 15 6 0 

ACCACCACAC CCTCCAAACA AAGCAACAAC AAGTACGCGG CCAGCAGCTA TCTGAGCCTG 162 0 

ACGCCTGAGC AGTGGAAGTC CCACAGAAGG TACAGCTGCC AGGTCACGCA TGAAGGGAGC 16 80 

ACCGTGGAGA AGACAGTGGC CCCTACAGAA TGTTCATAGT TCTAGATCTA CGTATGATCA 1740 

GCCTCGACTG TGCCTTCTAG TTGCCAGCCA TCTGTTGTTT GCCCCTCCCC CGTGCCTTCC 180 0 

TTGACCCTGG AAGGTGCCAC TCCCACTGTC CTTTCCTAAT AAAATGAGGA AATTGCATCG 186 0 

CATTGTCTGA GTAGGTGTCA TTCTATTCTG GGGGGTGGGG TGGGGCAGGA CAGCAAGGGG 192 0 
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GAGGATTGGG AAGACAATAG CAGGCATGCT GGGGATGCGG TGGGCTCTAT GGAACCAGCT 19 8 0 

GGGGCTCGAC AGCTCGAGCT AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA 2 04 0 

AAGGAAAATT AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG 210 0 

GATGCTTTAG AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC 216 0 

CCAGAGCTGA GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT 222 0 

CATCACCGAA GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG 22 8 0 

GATAGAGAGG GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC 23 4 0 

ACATTTGCTT CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCACCATG GTTGAACAAG 2 40 0 

ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG 246 0 

CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC 2 52 0 

CGGTTCTTTT TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGT^CTGCAG GACGAGGCAG 25 8 0 

CGCGGCTATC GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA 2 64 0 

CTGAAGCGGG AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT 270 0 

CTCACCTTGC TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA 27 60 

CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC 2 82 0 

GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC 2 88 0 

TCGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG 294 0 

TCGTGACCCA TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG 3 000 

GATTCATCGA CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA 3 060 

CCCGTGATAT TGCTGAAGAG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG 312 0 

GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT 3180 

GAGCGGGACT CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC CATCACGAGA 324 0 

TTTCGATTCC ACCGCCGCCT TCTATGAAAG GTTGGGCTTC GGAATCGTTT TCCGGGACGC 33 00 

CGGCTGGATG ATCCTCCAGC GCGGGGATCT CATGCTGGAG TTCTTCGCCC ACCCCAACTT 33 6 0 

GTTTATTGCA GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA 342 0 

AGCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG TATCTTATCA 3 48 0 

TGTCTGGATC GCGGCCGCGA TCCCGTCGAG AGCTTGGCGT AATCATGGTC ATAGCTGTTT 3 54 0 

CCTGTGTGAA ATTGTTATCC GCTCACAATT CCACACAACA TACGAGCCGG AAGCATAAAG 3 600 

TGTAAAGCCT GGGGTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG 3 660 

CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG 3 72 0 

GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC 37 8 0 
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TCGGTCGTTC GGCTGCGGCG AGCGGTATCA GCTCACTCAA 

ACAGAATCAG GGGATAACGC AGGAAAGAAC ATGTGAGCAA 

AACCGTAAAA AGGCCGCGTT GCTGGCGTTT TTCCATAGGC 

CACAAAAATC GACGCTCAAG TCAGAGGTGG CGAAACCCGA 

GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC 

TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT 

TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT 

CAGCCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG 

GACTTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA 

GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT 

GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA 

GGCAAACAAA CCACCGCTGG TAGCGGTGGT TTTTTTGTTT 

AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA 

AACGAAAACT CACGTTAAGG GATTTTGGTC ATGAGATTAT 

ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA 

TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT 

TCATCCATAG TTGCCTGACT CCCCGTCGTG TAGATAACTA 

TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT 

GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG 

TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA 

TTGCGCAACG TTGTTGCCAT TGCTACAGGC ATCGTGGTGT 

GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA 

AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA 

TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA 

TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT 

CCGAGTTGCT CTTGCCCGGC GTCAATACGG GATAATACCG 

AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC 

TTGAGATCCA GTTCGATGTA ACCCACTCGT GCACCCAACT 

TTCACCAGCG TTTCTGGGTG AGCAAAAACA GGAAGGCAAA 

AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCTTT 

TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT 

ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCT 



PCT/USOO/13694 

AGGCGGTAAT ACGGTTATCC 3 84 0 

AAGGCCAGCA AAAGGCCAGG 3 9 00 

TCCGCCCCCC TGACGAGCAT 3 9 60 

CAGGACTATA AAGATACCAG 402 0 

CGACCCTGCC GCTTACCGGA 40 80 

CTCAATGCTC ACGCTGTAGG 414 0 

GTGTGCACGA ACCCCCCGTT 42 0 0 

AGTCCAACCC GGTAAGACAC 42 60 

GCAGAGCGAG GTATGTAGGC 43 2 0 

ACACTAGAAG GACAGTATTT 43 8 0 

GAGTTGGTAG CTCTTGATCC 4440 

GCAAGCAGCA GATTACGCGC 45 0 0 

CGGGGTCTGA CGCTCAGTGG 45 6 0 

CAAAAAGGAT CTTCACCTAG 4 62 0 

GTATATATGA GTAAACTTGG 4 680 

CAGCGATCTG TCTATTTCGT 474 0 

CGATACGGGA GGGCTTACCA 4 8 00 

CACCGGCTCC AGATTTATCA 4860 

GTCCTGCAAC TTTATCCGCC 492 0 

GTAGTTCGCC AGTTAATAGT 4 980 

CACGCTCGTC GTTTGGTATG 5 04 0 

CATGATCCCC CATGTTGTGC 5100 

GAAGTAAGTT GGCCGCAGTG 5160 

CTGTCATGCC ATCCGTAAGA 522 0 

GAGAATAGTG TATGCGGCGA 5280 

CGCCACATAG CAGAACTTTA 53 4 0 

TCTCAAGGAT CTTACCGCTG 54 00 

GATCTTCAGC ATCTTTTACT 54 6 0 

ATGCCGCAAA AAAGGGAATA 552 0 

TTCAATATTA TTGAAGCATT 55 80 

GTATTTAGAA AAATAAACAA 5 64 0 

5679 
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(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAATTCTGAG CACACAGGAC CTCACCATGG GATGGAGCTG TATCATCCTC TTCTTGGTAG 6 0 

CAACAGCTAC AGGTGTCCAC TCCGAGGTGC AGCTGGTGGA GTCTGGGGGA GGCTTGGTAC 12 0 

AGCCTGGGGG GTCCCTGAGA CTCTCCTGCG CAGCCTCTGG AGTCTCCCTC AGTGGATACA 180 

AGATGAACTG GGTCCGCCAG GCTCCAGGGA AGGGGCTGGA ATGGGTCTCT TCCATTACTG 240 

GTATGAGTAA TTACATACAC TACTCAGACT CAGTGAAGGG CCGATTCACC ATCTCCAGAG 3 00 

ACAACGCCAT GAACTCACTG TATCTGCAAA TGAACAGCCT GACAGCCGAG GACACGGGTG 3 60 

TTTATTATTG TGCGACACAA CCGGGGGAGC TGGCGCCTTT TGACCATTGG GGCCAGGGAA 420 

CCCTGGTCAC CGTCTCCTCA GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT 480 

CCTCCAAGAG CACCTCTGGG GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC 54 0 

CCGAACCGGT GACGGTGTCG TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC 600 

CGGCTGTCCT ACAGTCCTCA GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA 6 60 

GCAGCTTGGG CACCCAGACC TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG 72 0 

TGGACAAGAA AGTTGAGCCC AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG 7 80 

CACCTGAACT CCTGGGGGGA CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC 84 0 

TCATGATCTC CCGGACCCCT GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC 9 00 

CTGAGGTCAA GTTCAACTGG TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC 9 60 

CGCGGGAGGA GCAGTACAAC AGCACGTACC GGGTGGTCAG CGTCCTCACC GTCCTGCACC 102 0 

AGGACTGGCT GAATGGCAAG GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC 10 8 0 

CCATCGAGAA AACCATCTCC AAAGCCAAAG GGCAGCCCCG AGAACCACAG GTGTACACCC 1140 

TGCCCCCATC CCGGGATGAG CTGACCAAGA ACCAGGTCAG CCTGACCTGC CTGGTCAAAG 12 00 

GCTTCTATCC CAGCGACATC GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT 1260 

ACAAGACCAC GCCTCCCGTG CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA 13 2 0 

CCGTGGACAA GAGCAGGTGG CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG 13 8 0 

CTCTGCACAA CCACTACACG CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA TGATAGATAT 1440 
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CT 1442 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GAATTCTGAG CACACAGGAC CTCACCATGG GATGGAGCTG TATCATCCTC TTCTTGGTAG 60 
CAACAGCTAC AGGTGTCCAC TCCCAGTCTG TGTTGACGCA GCCGCCCTCA GTCTCTGCGG 12 0 

CCCCAGGACA GAAGGTCACC ATCTCCTGCA CTGGGAGCAG CTCCAACCTC GGGGCAGGTT 180 

ATGATGTTCA CTGGTACCGG CAACTTCCAG GGACAGCCCC CAAACTCCTC ATCTATGATA 240 

ACAACAATCG GCCCTCAGGG GTCCCTGACC GATTCTCTGG CTCCAAGTCT GGCCCCTCAG 3 00 

CCTCCCTGGC CATCTCTGGG CTCCAGGCTG AGGATGAGGC TGATTATTAC TGCCAGTCCT 3 60 

ATGACAGCAG CCTGAATGGT TATGTCTTCG GAACTGGGAC CCAGCTCACC GTCCTAGGTC 42 0 

AGCCCAAGGC TGCCCCCTCG GTCACTCTGT TCCCGCCCTC CTCTGAGGAG CTTCAAGCCA 48 0 

ACAAGGCCAC ACTGGTGTGT CTCATAAGTG ACTTCTACCC GGGAGCCGTG ACAGTGGCCT 540 

GGAAGGCAAT TAGCAGCCCC GTCAAGGCGG GAGTGGAGAC CACCACACCC TCCAAACAAA 60 0 

GCAACAACAA GTACGCGGCC AGCAGCTATC TGAGCCTGAC GCCTGAGCAG TGGAAGTCCC 660 

ACAGAAGGTA CAGCTGCCAG GTCACGCATG AAGGGAGCAC CGTGGAGAAG ACAGTGGCCC 72 0 

CTACAGAATG TTCATAGTTC TAGATCTACG TATGATCAGC CT 7 62 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Val Gin Leu Leu Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Glu Val Gin Leu Val Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1899 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 14.. 1735 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGGGCAAATA ACA ATG GAG TTG CTA ATC CTC AAA GCA AAT GCA ATT ACC 49 

Met Glu Leu Leu lie Leu Lys Ala Asn Ala lie Thr 
15 10 

ACA ATC CTC ACT GCA GTC ACA TTT TGT TTT GCT TCT GOT CAA AAC ATC 9 7 

Thr lie Leu Thr Ala Val Thr Phe Cys Phe Ala Ser Gly Gin Asn lie 

15 20 25 

ACT GAA GAA TTT TAT CAA TCA ACA TGC AGT GCA GTT AGC AAA GGC TAT 145 
Thr Glu Glu Phe Tyr Gin Ser Thr Cys Ser Ala Val Ser Lys Gly Tyr 
30 35 40 

CTT AGT GCT CTG AGA ACT GGT TGG TAT ACC AGT GTT ATA ACT ATA GAA 193 
Leu Ser Ala Leu Arg Thr Gly Trp Tyr Thr Ser Val lie Thr lie Glu 
45 50 55 60 

TTA AGT AAT ATC AAG GAA AAT AAG TGT AAT GGA ACA GAT GCT AAG GTA 241 
Leu Ser Asn lie Lys Glu Asn Lys Cys Asn Gly Thr Asp Ala Lys Val 

65 70 75 

AAA TTG ATA AAA CAA GAA TTA GAT AAA TAT AAA AAT GCT GTA ACA GAA 2 89 

Lys Leu lie Lys Gin Glu Leu Asp Lys Tyr Lys Asn Ala Val Thr Glu 

80 85 90 

TTG CAG TTG CTC ATG CAA AGC ACA CCA CCA ACA AAC AAT CGA GCC AGA 33 7 

Leu Gin Leu Leu Met Gin Ser Thr Pro Pro Thr Asn Asn Arg Ala Arg 

95 100 105 
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AGA GAA CTA CCA AGG TTT ATG AAT TAT ACA CTC AAC AAT GCC AAA AAA 3 85 

Arg Glu Leu Pro Arg Phe Men Asn Tyr Thr Leu Asn Asn Ala Lys Lys 
110 115 120 

ACC AAT GTA ACA TTA AGC AAG AAA AGG AAA AGA AGA TTT CTT GGT TTT 43 3 

Thr Asn Val Thr Leu Ser Lys Lys Arg Lys Arg Arg Phe Leu Gly Phe 
125 130 135 140 

TTG TTA GGT GTT GGA TCT GCA ATC GCC AGT GGC GTT GCT GTA TCT AAG 481 
Leu Leu Gly Val Gly Ser Ala lie Ala Ser Gly Val Ala Val Ser Lys 

145 150 155 

GTC CTG CAC CTA GAA GGG GAA GTG AAC AAG ATC AAA AGT GCT CTA CTA 52 9 

Val Leu His Leu Glu Gly Glu Val Asn Lys lie Lys Ser Ala Leu Leu 

160 165 170 

TCC ACA AAC AAG GCT GTA GTC AGC TTA TCA AAT GGA GTT AGT GTC TTA 577 
Ser Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser Val Leu 
175 180 185 

ACC AGC AAA GTG TTA GAC CTC AAA AAC TAT ATA GAT AAA CAA TTG TTA 62 5 

Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr lie Asp Lys Gin Leu Leu 
190 195 200 

CCT ATT GTG AAC AAG CAA AGC TGC AGC ATA TCA AAT ATA GAA ACT GTG 673 
Pro lie Val Asn Lys Gin Ser Cys Ser lie Ser Asn lie Glu Thr Val 
205 210 215 220 

ATA GAG TTC CAA CAA AAG AAC AAC AGA CTA CTA GAG ATT ACC AGG GAA 721 

lie Glu Phe Gin Gin Lys Asn Asn Arg Leu Leu Glu lie Thr Arg Glu 

225 230 235 

TTT AGT GTT AAT GCA GGT GTA ACT ACA CCT GTA AGC ACT TAG ATG TTA 769 
Phe Ser Val Asn Ala Gly Val Thr Thr Pro Val Ser Thr Tyr Met Leu 

240 245 250 

ACT AAT AGT GAA TTA TTG TCA TTA ATC AAT GAT ATG CCT ATA ACA AAT 817 
Thr Asn Ser Glu Leu Leu Ser Leu lie Asn Asp Met Pro lie Thr Asn 
255 260 265 

GAT CAG AAA AAG TTA ATG TCC AAC AAT GTT CAA ATA GTT AGA CAG CAA 8 65 

Asp Gin Lys Lys Leu Met Ser Asn Asn Val Gin lie Val Arg Gin Gin 
270 275 280 

AGT TAG TCT ATC ATG TCC ATA ATA AAA GAG GAA GTC TTA GCA TAT GTA 913 
Ser Tyr Ser lie Met Ser lie lie Lys Glu Glu Val Leu Ala Tyr Val 
285 290 295 300 

GTA CAA TTA CCA CTA TAT GGT GTT ATA GAT ACA CCC TGT TGG AAA CTA 9 61 

Val Gin Leu Pro Leu Tyr Gly Val lie Asp Thr Pro Cys Trp Lys Leu 

305 310 315 

CAC ACA TCC CCT CTA TGT ACA ACC AAC ACA AAA GAA GGG TCC AAC ATC 10 09 

His Thr Ser Pro Leu Cys Thr Thr Asn Thr Lys Glu Gly Ser Asn lie 

320 325 330 

TGT TTA ACA AGA ACT GAC AGA GGA TGG TAC TGT GAC AAT GCA GGA TCA 1057 
Cys Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser 

335 340 345 

GTA TCT TTC TTC CCA CAA GCT GAA ACA TGT AAA GTT CAA TCA AAT CGA 1105 
Val Ser Phe Phe Pro Gin Ala Glu Thr Cys Lys Val Gin Ser Asn Arg 
350 355 360 
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GTA TTT TGT GAC ACA ATG AAC AGT TTA ACA TTA CCA AGT GAA ATA AAT 1153 
Val Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Ser Glu lie Asn 
365 370 375 380 

CTC TGC AAT GTT GAC ATA TTC AAC CCC AAA TAT GAT TGT AAA ATT ATG 12 01 

Leu Cys Asn Val Asp lie Phe Asn Pro Lys Tyr Asp Cys Lys lie Met 

385 390 395 

ACT TCA AAA ACA GAT GTA AGC AGC TCC GTT ATC ACA TCT CTA GGA GCC 12 49 

Thr Ser Lys Thr Asp Val Ser Ser Ser Val lie Thr Ser Leu Gly Ala 

400 405 410 

ATT GTG TCA TGC TAT GGC AAA ACT AAA TGT ACA GCA TCC AAT AAA AAT 12 97 

lie Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn 
415 420 425 

CGT GGA ATC ATA AAG ACA TTT TCT AAC GGG TGC GAT TAT GTA TCA AAT 13 45 

Arg Gly lie lie Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn 
430 435 440 

AAA GGG ATG GAC ACT GTG TCT GTA GGT AAC ACA TTA TAT TAT GTA AAT 13 9 3 

Lys Gly Met Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn 

445 450 455 460 

AAG CAA GAA GGT AAA AGT CTC TAT GTA AAA GGT GAA CCA ATA ATA AAT 1441 
Lys Gin Glu Gly Lys Ser Leu Tyr Val Lys Gly Glu Pro lie lie Asn 

465 470 475 

TTC TAT GAC CCA TTA GTA TTC CCC TCT GAT GAA TTT GAT GCA TCA ATA 14 89 

Phe Tyr Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser lie 

480 485 490 

TCT CAA GTC AAC GAG AAG ATT AAC CAG AGC CTA GCA TTT ATT CGT AAA 1537 
Ser Gin Val Asn Glu Lys lie Asn Gin Ser Leu Ala Phe lie Arg Lys 
495 500 505 

TCC GAT GAA TTA TTA CAT AAT GTA AAT GCT GGT AAA TCC ACC ACA AAT 15 8 5 

Ser Asp Glu Leu Leu His Asn Val Asn Ala Gly Lys Ser Thr Thr Asn 
510 515 520 

ATC ATG ATA ACT ACT ATA ATT ATA GTG ATT ATA GTA ATA TTG TTA TCA 163 3 

lie Met lie Thr Thr lie lie lie Val lie lie Val lie Leu Leu Ser 

525 530 535 540 

TTA ATT GCT GTT GGA CTG CTC TTA TAC TGT AAG GCC AGA AGC ACA CCA 16 81 

Leu lie Ala Val Gly Leu Leu Leu Tyr Cys Lys Ala Arg Ser Thr Pro 

545 550 555 

GTC ACA CTA AGC AAA GAT CAA CTG AGT GGT ATA AAT AAT ATT GCA TTT 1729 
Val Thr Leu Ser Lys Asp Gin Leu Ser Gly lie Asn Asn lie Ala Phe 

560 565 570 

AGT AAC TAAATAAAAA TAGCACCTAA TCATGTTCTT ACAATGGTTT ACTATCTGCT 17 85 
Ser Asn 

CATAGACAAC CCATCTGTCA TTGGATTTTC TTAAAATCTG AACTTCATCG AAACTCTCAT 1845 

CTATAAACCA TCTCACTTAC ACTATTTAAG TAGATTCCTA GTTTATAGTT ATAT 189 9 



(2) INFORMATION FOR SEQ ID NO; 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 

Met Glu Leu Leu lie Leu Lys Ala Asn Ala lie Thr Thr lie Leu Thr 
15 10 15 

Ala Val Thr Phe Cys Phe Ala Ser Gly Gin Asn lie Thr Glu Glu Phe 

20 25 30 

Tyr Gin Ser Thr Cys Ser Ala Val Ser Lys Gly Tyr Leu Ser Ala Leu 

35 40 45 

Arg Thr Gly Trp Tyr Thr Ser Val lie Thr lie Glu Leu Ser Asn lie 
50 55 60 

Lys Glu Asn Lys Cys Asn Gly Thr Asp Ala Lys Val Lys Leu lie Lys 
65 70 75 80 

Gin Glu Leu Asp Lys Tyr Lys Asn Ala Val Thr Glu Leu Gin Leu Leu 

85 90 95 

Met Gin Ser Thr Pro Pro Thr Asn Asn Arg Ala Arg Arg Glu Leu Pro 

100 105 110 

Arg Phe Met Asn Tyr Thr Leu Asn Asn Ala Lys Lys Thr Asn Val Thr 
115 120 125 

Leu Ser Lys Lys Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu Gly Val 
130 135 140 

Gly Ser Ala lie Ala Ser Gly Val Ala Val Ser Lys Val Leu His Leu 
145 150 155 160 

Glu Gly Glu Val Asn Lys lie Lys Ser Ala Leu Leu Ser Thr Asn Lys 

165 170 175 

Ala Val Val Ser Leu Ser Asn Gly Val Ser Val Leu Thr Ser Lys Val 

180 185 190 

Leu Asp Leu Lys Asn Tyr He Asp Lys Gin Leu Leu Pro He Val Asn 

195 200 205 

Lys Gin Ser Cys Ser He Ser Asn He Glu Thr Val He Glu Phe Gin 
210 215 220 

Gin Lys Asn Asn Arg Leu Leu Glu He Thr Arg Glu Phe Ser Val Asn 
225 230 235 240 

Ala Gly Val Thr Thr Pro Val Ser Thr Tyr Met Leu Thr Asn Ser Glu 

245 250 255 

Leu Leu Ser Leu He Asn Asp Met Pro He Thr Asn Asp Gin Lys Lys 

260 265 270 

Leu Met Ser Asn Asn Val Gin He Val Arg Gin Gin Ser Tyr Ser He 
275 280 285 
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Met Ser He He Lys Glu Glu Val Leu Ala Tyr Val Val Gin Leu Pro 

290 295 300 

Leu Tyr Gly Val He Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro 
305 310 315 320 

Leu Cys Thr Thr Asn Thr Lys Glu Gly Ser Asn He Cys Leu Thr Arg 

325 330 335 

Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe 

340 345 350 

Pro Gin Ala Glu Thr Cys Lys Val Gin Ser Asn Arg Val Phe Cys Asp 
355 360 365 

Thr Met Asn Ser Leu Thr Leu Pro Ser Glu He Asn Leu Cys Asn Val 
370 375 380 

Asp He Phe Asn Pro Lys Tyr Asp Cys Lys He Met Thr Ser Lys Thr 
385 390 395 400 

Asp Val Ser Ser Ser Val He Thr Ser Leu Gly Ala He Val Ser Cys 

405 410 415 

Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly He He 

420 425 430 

Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly Met Asp 
435 440 445 

Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Gin Glu Gly 
450 455 460 

Lys Ser Leu Tyr Val Lys Gly Glu Pro He He Asn Phe Tyr Asp Pro 
465 470 475 480 

Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser He Ser Gin Val Asn 

485 490 495 

Glu Lys He Asn Gin Ser Leu Ala Phe He Arg Lys Ser Asp Glu Leu 

500 505 510 

Leu His Asn Val Asn Ala Gly Lys Ser Thr Thr Asn He Met He Thr 
515 520 525 

Thr He He He Val He He Val He Leu Leu Ser Leu He Ala Val 
530 535 540 

Gly Leu Leu Leu Tyr Cys Lys Ala Arg Ser Thr Pro Val Thr Leu Ser 
545 550 555 560 

Lys Asp Gin Leu Ser Gly He Asn Asn He Ala Phe Ser Asn 

565 570 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
15 10 15 
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